| r0 | r1 | r2 | r3 |
Experiment Name | K-Means scalability icpx-O3-aggressive 100000000 | K-Means scalability icpx-O3-aggressive 100000000 | K-Means scalability acfl-O3-all 100000000 | K-Means scalability gcc-O3-funroll 100000000 |
Application | ./kmeans/kmeans-icpx-O3-aggressive | same as r0 | ./kmeans/kmeans-acfl-O3-all | ./kmeans/kmeans-gcc-O3-funroll |
Timestamp | 2025-06-25 14:19:27 | 2025-06-23 16:14:50 | 2025-07-07 09:20:12 | 2025-06-24 09:38:07 |
Experiment Type | OpenMP; | same as r0 | same as r0 | same as r0 |
Machine | otterfall | skylake | ip-172-31-18-66 | ip-172-31-47-249.ec2.internal |
Architecture | x86_64 | same as r0 | aarch64 | same as r2 |
Micro Architecture | SKYLAKE | same as r0 | ARM_NEOVERSE_V1 | ARM_NEOVERSE_V2 |
Model Name | Intel(R) Xeon(R) Silver 4210R CPU @ 2.40GHz | Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz | | |
Cache Size | 14080 KB | 36608 KB | | |
Number of Cores | 10 | 26 | | |
Maximal Frequency | 3.2 GHz | 2.1 GHz | 0 GHz | same as r2 |
OS Version | Linux 6.12.1-arch1-1 #1 SMP PREEMPT_DYNAMIC Fri, 22 Nov 2024 16:04:27 +0000 | Linux 6.10.10-arch1-1 #1 SMP PREEMPT_DYNAMIC Thu, 12 Sep 2024 17:21:02 +0000 | Linux 6.8.0-1030-aws #32-Ubuntu SMP Wed May 28 19:33:40 UTC 2025 | Linux 6.1.109-118.189.amzn2023.aarch64 #1 SMP Tue Sep 10 08:58:40 UTC 2024 |
Architecture used during static analysis | x86_64 | same as r0 | aarch64 | same as r2 |
Micro Architecture used during static analysis | SKYLAKE | same as r0 | ARM_NEOVERSE_V1 | ARM_NEOVERSE_V2 |
Compilation Options |
kmeans-icpx-O3-aggressive: clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.2.1 (2024.2.1.20240711) --driver-mode=g++ --intel -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fiopenmp -O3 -x Host -funroll-loops -ffast-math -c -o main.o main.cpp -fveclib=SVML -faltmathlib=SVML -fheinous-gnu-extensions | kmeans-icpx-O3-aggressive: clang based Intel(R) oneAPI DPC++/C++ Compiler 2024.2.1 (2024.2.1.20240711) --driver-mode=g++ --intel -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fiopenmp -O3 -x Host -funroll-loops -ffast-math -qopt-report=5 -c -o main.o main.cpp -fveclib=SVML -faltmathlib=SVML -fheinous-gnu-extensions | kmeans-acfl-O3-all: Arm C/C++/Fortran Compiler version 24.10.1 (build number 4) (based on LLVM 19.1.0) /opt/arm/arm-linux-compiler-24.10.1_Ubuntu-20.04/llvm-bin/clang-19 --driver-mode=g++ -I . -MMD -MP -march=native -std=c++14 -g -fno-omit-frame-pointer -fopenmp -O3 -funroll-loops -ffast-math -grecord-command-line -c -o main.o main.cpp | kmeans-gcc-O3-funroll: GNU C++14 14.2.0 -mlittle-endian -mabi=lp64 -mcpu=neoverse-v2+crc+sve2-aes+sve2-sha3+nossbs -g -O3 -std=c++14 -fno-omit-frame-pointer -fopenmp -funroll-loops |
Number of processes observed | 1 | same as r0 | same as r0 | same as r0 |
Number of threads observed | 8 | same as r0 | same as r0 | same as r0 |
Frequency Driver | intel_pstate | intel_cpufreq | NA | same as r2 |
Frequency Governor | performance | same as r0 | NA | same as r2 |
Huge Pages | always | same as r0 | madvise | same as r2 |
Hyperthreading | off | same as r0 | same as r0 | same as r0 |
Number of sockets | 1 | 2 | same as r0 | same as r0 |
Number of cores per socket | 10 | 26 | 64 | 96 |
MAQAO version | 2025.1.0 | same as r0 | 2025.1.1 | same as r0 |
MAQAO build | 1cd8232d3b2009bc695f526f903b266bda9bb996::20250623-181852 | e913a471001afb562449956a906221b5bfa8ea0d::20250617-163738 | f3e40b5f1dbd62488bc0cc5f885d40677c87bfe8::20250630-094248 | same as r0 |
Comments | Intel Xeon 42104R (Cascade Lake CPU), 1-10 threads runs | Intel Xeon Platinum 8170 (Skylake CPU), 1-26 threads runs | AWS Graviton 3 (Neoverse V1) CPU, 1-64 threads runs | AWS Graviton 4 (Neoverse V2) CPU, 1-96 threads runs |