Requesting Large Pages Shared Segment Pointer = 504403158265495552 Segment Size (DW) = 268435456 (MB = 2048 ) Vector Size (DW) = 67108864 (MB = 512 ) Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 rebind: num_parthds is 8 Starting Initialization Done With Initialization a(1) 1.00000000000000000 a(N) 0.000000000000000000E+00 Base Offset = 67108864 Incremental Offset = 384 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11953.7194 .0906 .0898 .0917 Scale: 11863.5820 .0911 .0905 .0919 Add: 14641.4878 .1104 .1100 .1108 Triad: 14437.8072 .1119 .1116 .1121 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 512 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12112.2137 .0888 .0886 .0890 Scale: 12044.9791 .0894 .0891 .0896 Add: 14724.4627 .1096 .1094 .1100 Triad: 14988.6830 .1080 .1075 .1084 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 640 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11462.8953 .0942 .0937 .0945 Scale: 12041.7424 .0894 .0892 .0895 Add: 14379.1102 .1123 .1120 .1127 Triad: 14993.0908 .1080 .1074 .1087 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 768 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12237.7764 .0880 .0877 .0883 Scale: 11354.7766 .0949 .0946 .0953 Add: 14640.4406 .1101 .1100 .1101 Triad: 14745.8364 .1095 .1092 .1096 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 896 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11994.0387 .0898 .0895 .0902 Scale: 11299.5362 .0952 .0950 .0953 Add: 14905.3756 .1082 .1081 .1084 Triad: 12047.4548 .1340 .1337 .1341 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1024 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12128.0170 .0888 .0885 .0890 Scale: 12039.6017 .0893 .0892 .0894 Add: 14649.2984 .1105 .1099 .1109 Triad: 14865.7364 .1085 .1083 .1086 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1152 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11429.7597 .0944 .0939 .0948 Scale: 12132.0355 .0887 .0885 .0888 Add: 12889.4009 .1253 .1250 .1256 Triad: 15232.2915 .1065 .1057 .1074 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1280 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12198.4472 .0882 .0880 .0884 Scale: 11764.5666 .0914 .0913 .0915 Add: 14456.0666 .1120 .1114 .1125 Triad: 14946.0038 .1078 .1078 .1078 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1408 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12038.5557 .0897 .0892 .0900 Scale: 11819.7269 .0913 .0908 .0917 Add: 14720.6285 .1096 .1094 .1098 Triad: 14449.5114 .1118 .1115 .1120 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1536 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12158.8940 .0885 .0883 .0886 Scale: 12091.5475 .0897 .0888 .0912 Add: 14828.1950 .1147 .1086 .1267 Triad: 15141.9843 .1072 .1064 .1087 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- bindprocessor successful: thread_self() 438339 cpu_id 7 bindprocessor successful: thread_self() 405685 cpu_id 6 bindprocessor successful: thread_self() 393283 cpu_id 2 bindprocessor successful: thread_self() 565249 cpu_id 3 bindprocessor successful: thread_self() 499901 cpu_id 0 bindprocessor successful: thread_self() 413699 cpu_id 1 bindprocessor successful: thread_self() 512073 cpu_id 4 bindprocessor successful: thread_self() 540835 cpu_id 5