Requesting Large Pages Shared Segment Pointer = 504403158265495552 Segment Size (DW) = 268435456 (MB = 2048 ) Vector Size (DW) = 67108864 (MB = 512 ) Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 rebind: num_parthds is 8 Starting Initialization Done With Initialization a(1) 1.00000000000000000 a(N) 0.000000000000000000E+00 Base Offset = 67108864 Incremental Offset = 384 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11898.8760 .0909 .0902 .0912 Scale: 11752.3019 .0918 .0914 .0921 Add: 14567.2917 .1108 .1106 .1110 Triad: 14271.7847 .1131 .1129 .1132 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 512 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12092.0182 .0896 .0888 .0903 Scale: 12037.8317 .0893 .0892 .0896 Add: 14888.8842 .1084 .1082 .1085 Triad: 15015.8695 .1074 .1073 .1075 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 640 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11372.0802 .0947 .0944 .0950 Scale: 12021.8613 .0896 .0893 .0900 Add: 14304.6952 .1130 .1126 .1134 Triad: 14942.7144 .1080 .1078 .1082 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 768 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12118.0311 .0888 .0886 .0891 Scale: 11190.6105 .0961 .0960 .0962 Add: 14948.2528 .1079 .1077 .1080 Triad: 14542.1578 .1108 .1108 .1109 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 896 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11988.9141 .0901 .0896 .0905 Scale: 10960.1033 .0981 .0980 .0982 Add: 14872.4129 .1084 .1083 .1086 Triad: 11343.9844 .1421 .1420 .1422 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1024 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12008.5102 .0894 .0894 .0895 Scale: 11976.4482 .0899 .0897 .0900 Add: 14355.9021 .1124 .1122 .1126 Triad: 14517.6720 .1110 .1109 .1111 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1152 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11312.9903 .0953 .0949 .0955 Scale: 12096.3378 .0888 .0888 .0888 Add: 12230.5867 .1318 .1317 .1319 Triad: 15051.6683 .1077 .1070 .1084 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1280 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12244.4808 .0879 .0877 .0881 Scale: 11761.8782 .0915 .0913 .0917 Add: 14228.3937 .1134 .1132 .1136 Triad: 15189.3598 .1063 .1060 .1065 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1408 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 11945.2697 .0905 .0899 .0915 Scale: 11739.1601 .0915 .0915 .0915 Add: 14923.6343 .1080 .1079 .1082 Triad: 14309.9676 .1127 .1126 .1129 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1536 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 12058.9604 .0892 .0890 .0894 Scale: 12071.7282 .0890 .0889 .0891 Add: 14924.6399 .1081 .1079 .1084 Triad: 15089.8456 .1070 .1067 .1071 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- bindprocessor successful: thread_self() 680095 cpu_id 6 bindprocessor successful: thread_self() 385099 cpu_id 7 bindprocessor successful: thread_self() 737411 cpu_id 5 bindprocessor successful: thread_self() 684249 cpu_id 4 bindprocessor successful: thread_self() 700651 cpu_id 1 bindprocessor successful: thread_self() 745633 cpu_id 0 bindprocessor successful: thread_self() 757765 cpu_id 2 bindprocessor successful: thread_self() 696449 cpu_id 3