OV - K-Means scalability acfl-O3-funroll 100000000

********************************************************************************
MAQAO 2025.1.1 - f3e40b5f1dbd62488bc0cc5f885d40677c87bfe8::20250630-094248 || 2025/06/30
/home/fmusial/MAQAO/bin/maqao oneview --create-report=one --with-FLOPS --replace --run-directory=/home/fmusial/KMEANS_Benchmarks --executable=kmeans/kmeans-acfl-O3-funroll "--experiment-name=K-Means scalability acfl-O3-funroll 100000000" "--run-command= input/100000000.in 1000 100000000 50 25" -c=/home/fmusial/KMEANS_Benchmarks/kmeans_multiruns_conf_neoverse_v1.json -WS -dbg=1 -xp=/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000 
CPY:  [true] ./kmeans/kmeans-acfl-O3-funroll --> /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll
CMD:  OMP_PROC_BIND=true  OMP_NUM_THREADS=1   /home/fmusial/MAQAO/bin/maqao lprof _caller=oneview  --xp="/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/tools/lprof_npsu_run_0" --mpi-command="" --collect-CPU-time-intervals -p=NEON_SVE_FLOP  --collect-topology tpp=1   -- /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll input/100000000.in 1000 100000000 50 25
CMD:  OMP_PROC_BIND=true  OMP_NUM_THREADS=2   /home/fmusial/MAQAO/bin/maqao lprof _caller=oneview  --xp="/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/tools/lprof_npsu_run_1" --mpi-command="" --collect-CPU-time-intervals -p=NEON_SVE_FLOP  --collect-topology tpp=2   -- /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll input/100000000.in 1000 100000000 50 25
CMD:  OMP_PROC_BIND=true  OMP_NUM_THREADS=4   /home/fmusial/MAQAO/bin/maqao lprof _caller=oneview  --xp="/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/tools/lprof_npsu_run_2" --mpi-command="" --collect-CPU-time-intervals -p=NEON_SVE_FLOP  --collect-topology tpp=4   -- /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll input/100000000.in 1000 100000000 50 25
CMD:  OMP_PROC_BIND=true  OMP_NUM_THREADS=8   /home/fmusial/MAQAO/bin/maqao lprof _caller=oneview  --xp="/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/tools/lprof_npsu_run_3" --mpi-command="" --collect-CPU-time-intervals -p=NEON_SVE_FLOP  --collect-topology tpp=8   -- /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll input/100000000.in 1000 100000000 50 25
CMD:  OMP_PROC_BIND=true  OMP_NUM_THREADS=16   /home/fmusial/MAQAO/bin/maqao lprof _caller=oneview  --xp="/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/tools/lprof_npsu_run_4" --mpi-command="" --collect-CPU-time-intervals -p=NEON_SVE_FLOP  --collect-topology tpp=16   -- /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll input/100000000.in 1000 100000000 50 25
CMD:  OMP_PROC_BIND=true  OMP_NUM_THREADS=32   /home/fmusial/MAQAO/bin/maqao lprof _caller=oneview  --xp="/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/tools/lprof_npsu_run_5" --mpi-command="" --collect-CPU-time-intervals -p=NEON_SVE_FLOP  --collect-topology tpp=32   -- /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll input/100000000.in 1000 100000000 50 25
CMD:  OMP_PROC_BIND=true  OMP_NUM_THREADS=48   /home/fmusial/MAQAO/bin/maqao lprof _caller=oneview  --xp="/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/tools/lprof_npsu_run_6" --mpi-command="" --collect-CPU-time-intervals -p=NEON_SVE_FLOP  --collect-topology tpp=48   -- /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll input/100000000.in 1000 100000000 50 25
CMD:  OMP_PROC_BIND=true  OMP_NUM_THREADS=64   /home/fmusial/MAQAO/bin/maqao lprof _caller=oneview  --xp="/home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/tools/lprof_npsu_run_7" --mpi-command="" --collect-CPU-time-intervals -p=NEON_SVE_FLOP  --collect-topology tpp=64   -- /home/fmusial/KMEANS_Benchmarks/results/scalability/acfl/acfl-O3-funroll/points_100000000/binaries/kmeans-acfl-O3-funroll input/100000000.in 1000 100000000 50 25
In run run_1_thread, 1 loops were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.938167989254% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
1 functions were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
6 functions were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.28056421596557% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
1 functions were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.0036227945238352% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
In run run_8_threads, 1 loops were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.90932697057724% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
In run run_16_threads, 1 loops were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.6988839507103% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
1 functions were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.0027088522911072% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
In run run_32_threads, 1 loops were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.56407618522644% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
2 functions were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.004117344506085% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
In run run_48_threads, 1 loops were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.3958246409893% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
1 functions were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.0016701462445781% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
In run run_64_threads, 1 loops were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.3276549577713% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
3 functions were discarded from static analysis because their coverage
are lower than object_coverage_threshold value (0.01%).
That represents 0.0042552592931316% of the execution time. To include them, change the value
in the experiment directory configuration file, then rerun the command with the additionnal parameter
--force-static-analysis
Report Configuration