perf-bench — General framework for benchmark suites
Synopsis
perf bench [<common options>] <subsystem> <suite> [<options>]
Description
This perf bench command is a general framework for benchmark suites.
Common Options
- -r, --repeat=
- Specify amount of times to repeat the run (default 10). 
- -f, --format=
- Specify format style. Current available format styles are: 
- default
- Default style. This is mainly for human reading. 
.ft C
% perf bench sched pipe                      # with no style specified
(executing 1000000 pipe operations between two tasks)
        Total time:5.855 sec
                5.855061 usecs/op
                170792 ops/sec
.ft
- simple
- This simple style is friendly for automated processing by scripts. 
.ft C % perf bench --format=simple sched pipe # specified simple 5.988 .ft
Subsystem
- sched
- Scheduler and IPC mechanisms. 
- mem
- Memory access performance. 
- numa
- NUMA scheduling and MM benchmarks. 
- futex
- Futex stressing benchmarks. 
- epoll
- Eventpoll (epoll) stressing benchmarks. 
- all
- All benchmark subsystems. 
SUITES FOR sched
- messaging
- Suite for evaluating performance of scheduler and IPC mechanisms. Based on hackbench by Rusty Russell. 
Options of messaging
- -p, --pipe
- Use pipe() instead of socketpair() 
- -t, --thread
- Be multi thread instead of multi process 
- -g, --group=
- Specify number of groups 
- -l, --nr_loops=
- Specify number of loops 
Example of messaging
.ft C
% perf bench sched messaging                 # run with default
options (20 sender and receiver processes per group)
(10 groups == 400 processes run)
      Total time:0.308 sec
% perf bench sched messaging -t -g 20        # be multi-thread, with 20 groups
(20 sender and receiver threads per group)
(20 groups == 800 threads run)
      Total time:0.582 sec
.ft
- pipe
- Suite for pipe() system call. Based on pipe-test-1m.c by Ingo Molnar. 
Options of pipe
- -l, --loop=
- Specify number of loops. 
Example of pipe
.ft C
% perf bench sched pipe
(executing 1000000 pipe operations between two tasks)
        Total time:8.091 sec
                8.091833 usecs/op
                123581 ops/sec
% perf bench sched pipe -l 1000              # loop 1000
(executing 1000 pipe operations between two tasks)
        Total time:0.016 sec
                16.948000 usecs/op
                59004 ops/sec
.ftSUITES FOR mem
- memcpy
- Suite for evaluating performance of simple memory copy in various ways. 
Options of memcpy
- -l, --size
- Specify size of memory to copy (default: 1MB). Available units are B, KB, MB, GB and TB (case insensitive). 
- -f, --function
- Specify function to copy (default: default). Available functions are depend on the architecture. On x86-64, x86-64-unrolled, x86-64-movsq and x86-64-movsb are supported. 
- -l, --nr_loops
- Repeat memcpy invocation this number of times. 
- -c, --cycles
- Use perf’s cpu-cycles event instead of gettimeofday syscall. 
- memset
- Suite for evaluating performance of simple memory set in various ways. 
Options of memset
- -l, --size
- Specify size of memory to set (default: 1MB). Available units are B, KB, MB, GB and TB (case insensitive). 
- -f, --function
- Specify function to set (default: default). Available functions are depend on the architecture. On x86-64, x86-64-unrolled, x86-64-stosq and x86-64-stosb are supported. 
- -l, --nr_loops
- Repeat memset invocation this number of times. 
- -c, --cycles
- Use perf’s cpu-cycles event instead of gettimeofday syscall. 
SUITES FOR numa
- mem
- Suite for evaluating NUMA workloads. 
SUITES FOR futex
- hash
- Suite for evaluating hash tables. 
- wake
- Suite for evaluating wake calls. 
- wake-parallel
- Suite for evaluating parallel wake calls. 
- requeue
- Suite for evaluating requeue calls. 
- lock-pi
- Suite for evaluating futex lock_pi calls. 
SUITES FOR epoll
- wait
- Suite for evaluating concurrent epoll_wait calls. 
- ctl
- Suite for evaluating multiple epoll_ctl calls.