Requesting Large Pages
Shared Segment Pointer = 504403158265495552
Segment Size (DW) = 268435456 (MB = 2048 )
Vector Size (DW) = 67108864 (MB = 512 )
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
Num_threads = 8
rebind: num_parthds is 8
Starting Initialization
Done With Initialization
a(1) 1.00000000000000000
a(N) 0.000000000000000000E+00
Base Offset = 67108864
Incremental Offset = 2048
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 93615 microseconds
(= 93615 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 14346.1691 .0753 .0748 .0760
Scale: 14314.8193 .0752 .0750 .0754
Add: 13134.2402 .1230 .1226 .1234
Triad: 13140.8063 .1227 .1226 .1231
Sum of a is = 101921066268750.000
Sum of b is = 20384213253750.0000
Sum of c is = 27178951005000.0000
Base Offset = 67108864
Incremental Offset = 2304
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 93207 microseconds
(= 93207 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 16106.3155 .1397 .0667 .0760
Scale: 15722.2515 .1405 .0683 .0754
Add: 15881.8475 .1869 .1014 .1234
Triad: 15793.3853 .1871 .1020 .1231
Sum of a is = 101921066268750.000
Sum of b is = 20384213253750.0000
Sum of c is = 27178951005000.0000
Base Offset = 67108864
Incremental Offset = 2560
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 93483 microseconds
(= 93483 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 16106.3155 .1832 .0667 .0760
Scale: 16075.4716 .1805 .0668 .0754
Add: 15881.8475 .2202 .1014 .1234
Triad: 15793.3853 .2201 .1020 .1231
Sum of a is = 101921066268750.000
Sum of b is = 20384213253750.0000
Sum of c is = 27178951005000.0000
Base Offset = 67108864
Incremental Offset = 2816
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 93726 microseconds
(= 93726 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 16119.6036 .2027 .0666 .0760
Scale: 16075.4716 .2015 .0668 .0754
Add: 15881.8475 .2336 .1014 .1234
Triad: 15960.4201 .2329 .1009 .1231
Sum of a is = 101921066268750.000
Sum of b is = 20384213253750.0000
Sum of c is = 27178951005000.0000
Base Offset = 67108864
Incremental Offset = 3072
Number of Threads = 8
----------------------------------------------
Double precision appears to have 16 digits of accuracy
Assuming 8 bytes per DOUBLE PRECISION word
----------------------------------------------
Array size = 67108864
Offset = 0
The total memory requirement is 1536 MB
You are running each test 5 times
The *best* time for each test is used
----------------------------------------------------
Your clock granularity appears to be less than one microsecond
Your clock granularity/precision appears to be 1 microseconds
The tests below will each take a time on the order
of 92898 microseconds
(= 92898 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
----------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
----------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 16119.6036 .2149 .0666 .0760
Scale: 16075.4716 .2142 .0668 .0754
Add: 15881.8475 .2497 .1014 .1251
Triad: 15960.4201 .2490 .1009 .1244
Sum of a is = 101921066268750.000
Sum of b is = 20384213253750.0000
Sum of c is = 27178951005000.0000
bindprocessor successful: thread_self() 43665 cpu_id 6
bindprocessor successful: thread_self() 44715 cpu_id 2
bindprocessor successful: thread_self() 44403 cpu_id 3
bindprocessor successful: thread_self() 50407 cpu_id 7
bindprocessor successful: thread_self() 63391 cpu_id 5
bindprocessor successful: thread_self() 50143 cpu_id 4
bindprocessor successful: thread_self() 54519 cpu_id 0
bindprocessor successful: thread_self() 49719 cpu_id 1