deeplearning/dl-op-linalg-batch-matmul-benchmark.json

2025-10-13 07:43:23 UTC

dl-op-linalg-batch-matmul-benchmark.json

NameTime (ms)CPU (ms)Iterations
DL_OPS_BATCH_MATMUL/Scalar/iterations:13.63e+033.63e+031
DL_OPS_BATCH_MATMUL/AutoVectorization/iterations:11.01e+031.01e+031
DL_OPS_BATCH_MATMUL/Vectorization/iterations:11961961
DL_OPS_BATCH_MATMUL/Tile/iterations:11121121
DL_OPS_BATCH_MATMUL/SCF/iterations:11211211
DL_OPS_BATCH_MATMUL/BROADCAST/iterations:13673671
DL_OPS_BATCH_MATMUL/BROADCAST_OMP/iterations:111122.71
Console output
2025-09-07T12:45:54+00:00
Running ./dl-op-linalg-batch-matmul-benchmark
Run on (24 X 5100 MHz CPU s)
CPU Caches:
  L1 Data 48 KiB (x12)
  L1 Instruction 32 KiB (x12)
  L2 Unified 1280 KiB (x12)
  L3 Unified 30720 KiB (x1)
Load Average: 2.41, 3.31, 5.08
***WARNING*** CPU scaling is enabled, the benchmark real time measurements may be noisy and will incur extra overhead.
---------------------------------------------------------------------------------------------
Benchmark                                                   Time             CPU   Iterations
---------------------------------------------------------------------------------------------
DL_OPS_BATCH_MATMUL/Scalar/iterations:1                  3635 ms         3635 ms            1
DL_OPS_BATCH_MATMUL/AutoVectorization/iterations:1       1006 ms         1006 ms            1
DL_OPS_BATCH_MATMUL/Vectorization/iterations:1            196 ms          196 ms            1
DL_OPS_BATCH_MATMUL/Tile/iterations:1                     112 ms          112 ms            1
DL_OPS_BATCH_MATMUL/SCF/iterations:1                      121 ms          121 ms            1
DL_OPS_BATCH_MATMUL/BROADCAST/iterations:1                367 ms          367 ms            1
DL_OPS_BATCH_MATMUL/BROADCAST_OMP/iterations:1            111 ms         22.7 ms            1
---------- Verification ----------
Tile PASS
SCF PASS
BROADCAST PASS
BROADCAST_OMP PASS