deeplearning/dl-op-linalg-matmul-benchmark.json

2025-10-13 07:43:23 UTC

dl-op-linalg-matmul-benchmark.json

NameTime (ms)CPU (ms)Iterations
DL_OPS_MATMUL/scalar_O0/iterations:14.1e+034.1e+031
DL_OPS_MATMUL/scalar_O3/iterations:13.58e+033.58e+031
DL_OPS_MATMUL/tile/iterations:11081081
DL_OPS_MATMUL/vec/iterations:161.461.41
DL_OPS_MATMUL/vec_omp/iterations:118.57.881
Console output
2025-09-07T12:45:36+00:00
Running ./dl-op-linalg-matmul-benchmark
Run on (24 X 5100 MHz CPU s)
CPU Caches:
  L1 Data 48 KiB (x12)
  L1 Instruction 32 KiB (x12)
  L2 Unified 1280 KiB (x12)
  L3 Unified 30720 KiB (x1)
Load Average: 2.48, 3.39, 5.15
***WARNING*** CPU scaling is enabled, the benchmark real time measurements may be noisy and will incur extra overhead.
-------------------------------------------------------------------------------
Benchmark                                     Time             CPU   Iterations
-------------------------------------------------------------------------------
DL_OPS_MATMUL/scalar_O0/iterations:1       4100 ms         4100 ms            1
DL_OPS_MATMUL/scalar_O3/iterations:1       3583 ms         3583 ms            1
DL_OPS_MATMUL/tile/iterations:1             108 ms          108 ms            1
DL_OPS_MATMUL/vec/iterations:1             61.4 ms         61.4 ms            1
DL_OPS_MATMUL/vec_omp/iterations:1         18.5 ms         7.88 ms            1
---------- Verification ----------
tile PASS
vec PASS
vec_omp PASS