Requesting Large Pages
Shared Segment Pointer = 504403158265495552
Segment Size (DW) = 268435456 (MB = 2048 )
Vector Size (DW) = 67108864 (MB = 512 )
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
rebind: num_parthds is 8
Starting Initialization
Done With Initialization
a(1) 1.00000000000000000
a(N) 0.000000000000000000E+00
Base Offset = 67108864
Incremental Offset = 2048
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 61124 microseconds
(= 61124 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 14055.6083 .0768 .0764 .0773
Scale: 15153.9917 .0712 .0709 .0716
Add: 12883.4029 .1254 .1250 .1259
Triad: 12796.9104 .1265 .1259 .1271
Sum of a is = 101921587200000.000
Sum of b is = 20384317440000.0000
Sum of c is = 27179089920000.0000
Base Offset = 67108864
Incremental Offset = 2304
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 62754 microseconds
(= 62754 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 17803.0408 .0605 .0603 .0611
Scale: 16773.8389 .0641 .0640 .0642
Add: 19333.0131 .0837 .0833 .0840
Triad: 19394.8739 .0834 .0830 .0836
Sum of a is = 101921587200000.000
Sum of b is = 20384317440000.0000
Sum of c is = 27179089920000.0000
Base Offset = 67108864
Incremental Offset = 2560
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 61802 microseconds
(= 61802 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 13896.0102 .0777 .0773 .0783
Scale: 17970.5504 .0598 .0598 .0599
Add: 17778.7939 .0907 .0906 .0910
Triad: 18129.0953 .0891 .0888 .0894
Sum of a is = 101921587200000.000
Sum of b is = 20384317440000.0000
Sum of c is = 27179089920000.0000
Base Offset = 67108864
Incremental Offset = 2816
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 63175 microseconds
(= 63175 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 17762.5609 .0607 .0604 .0610
Scale: 17371.3125 .0621 .0618 .0623
Add: 19566.5475 .0826 .0823 .0828
Triad: 19741.2297 .0826 .0816 .0836
Sum of a is = 101921587200000.000
Sum of b is = 20384317440000.0000
Sum of c is = 27179089920000.0000
Base Offset = 67108864
Incremental Offset = 3072
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 61253 microseconds
(= 61253 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 14127.5660 .0778 .0760 .0785
Scale: 15084.1008 .0716 .0712 .0723
Add: 13374.1816 .1210 .1204 .1218
Triad: 13257.9304 .1219 .1215 .1223
Sum of a is = 101921587200000.000
Sum of b is = 20384317440000.0000
Sum of c is = 27179089920000.0000
bindprocessor successful: thread_self() 716815 cpu_id 2
bindprocessor successful: thread_self() 675971 cpu_id 3
bindprocessor successful: thread_self() 643123 cpu_id 4
bindprocessor successful: thread_self() 700479 cpu_id 5
bindprocessor successful: thread_self() 663569 cpu_id 1
bindprocessor successful: thread_self() 372975 cpu_id 0
bindprocessor successful: thread_self() 704759 cpu_id 6
bindprocessor successful: thread_self() 593945 cpu_id 7