Requesting Large Pages Shared Segment Pointer = 504403158265495552 Segment Size (DW) = 268435456 (MB = 2048 ) Vector Size (DW) = 67108864 (MB = 512 ) Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 rebind: num_parthds is 8 Starting Initialization Done With Initialization a(1) 1.00000000000000000 a(N) 0.000000000000000000E+00 Base Offset = 67108864 Incremental Offset = 2048 Number of Threads = 8 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 Offset = 0 The total memory requirement is 1536 MB You are running each test 5 times The *best* time for each test is used ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds The tests below will each take a time on the order of 61124 microseconds (= 61124 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ---------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ---------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Copy: 14055.6083 .0768 .0764 .0773 Scale: 15153.9917 .0712 .0709 .0716 Add: 12883.4029 .1254 .1250 .1259 Triad: 12796.9104 .1265 .1259 .1271 Sum of a is = 101921587200000.000 Sum of b is = 20384317440000.0000 Sum of c is = 27179089920000.0000 Base Offset = 67108864 Incremental Offset = 2304 Number of Threads = 8 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 Offset = 0 The total memory requirement is 1536 MB You are running each test 5 times The *best* time for each test is used ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds The tests below will each take a time on the order of 62754 microseconds (= 62754 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ---------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ---------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Copy: 17803.0408 .0605 .0603 .0611 Scale: 16773.8389 .0641 .0640 .0642 Add: 19333.0131 .0837 .0833 .0840 Triad: 19394.8739 .0834 .0830 .0836 Sum of a is = 101921587200000.000 Sum of b is = 20384317440000.0000 Sum of c is = 27179089920000.0000 Base Offset = 67108864 Incremental Offset = 2560 Number of Threads = 8 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 Offset = 0 The total memory requirement is 1536 MB You are running each test 5 times The *best* time for each test is used ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds The tests below will each take a time on the order of 61802 microseconds (= 61802 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ---------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ---------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Copy: 13896.0102 .0777 .0773 .0783 Scale: 17970.5504 .0598 .0598 .0599 Add: 17778.7939 .0907 .0906 .0910 Triad: 18129.0953 .0891 .0888 .0894 Sum of a is = 101921587200000.000 Sum of b is = 20384317440000.0000 Sum of c is = 27179089920000.0000 Base Offset = 67108864 Incremental Offset = 2816 Number of Threads = 8 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 Offset = 0 The total memory requirement is 1536 MB You are running each test 5 times The *best* time for each test is used ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds The tests below will each take a time on the order of 63175 microseconds (= 63175 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ---------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ---------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Copy: 17762.5609 .0607 .0604 .0610 Scale: 17371.3125 .0621 .0618 .0623 Add: 19566.5475 .0826 .0823 .0828 Triad: 19741.2297 .0826 .0816 .0836 Sum of a is = 101921587200000.000 Sum of b is = 20384317440000.0000 Sum of c is = 27179089920000.0000 Base Offset = 67108864 Incremental Offset = 3072 Number of Threads = 8 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 Offset = 0 The total memory requirement is 1536 MB You are running each test 5 times The *best* time for each test is used ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds The tests below will each take a time on the order of 61253 microseconds (= 61253 clock ticks) Increase the size of the arrays if this shows that you are not getting at least 20 clock ticks per test. ---------------------------------------------------- WARNING -- The above is only a rough guideline. For best results, please be sure you know the precision of your system timer. ---------------------------------------------------- Function Rate (MB/s) RMS time Min time Max time Copy: 14127.5660 .0778 .0760 .0785 Scale: 15084.1008 .0716 .0712 .0723 Add: 13374.1816 .1210 .1204 .1218 Triad: 13257.9304 .1219 .1215 .1223 Sum of a is = 101921587200000.000 Sum of b is = 20384317440000.0000 Sum of c is = 27179089920000.0000 bindprocessor successful: thread_self() 716815 cpu_id 2 bindprocessor successful: thread_self() 675971 cpu_id 3 bindprocessor successful: thread_self() 643123 cpu_id 4 bindprocessor successful: thread_self() 700479 cpu_id 5 bindprocessor successful: thread_self() 663569 cpu_id 1 bindprocessor successful: thread_self() 372975 cpu_id 0 bindprocessor successful: thread_self() 704759 cpu_id 6 bindprocessor successful: thread_self() 593945 cpu_id 7