Requesting Large Pages Shared Segment Pointer = 504403158265495552 Segment Size (DW) = 268435456 (MB = 2048 ) Vector Size (DW) = 67108864 (MB = 512 ) Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 rebind: num_parthds is 8 Starting Initialization Done With Initialization a(1) 1.00000000000000000 a(N) 0.000000000000000000E+00 Base Offset = 67108864 Incremental Offset = 384 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10756.1876 .1004 .0998 .1011 Scale: 10296.7898 .1048 .1043 .1053 Add: 11089.5143 .1456 .1452 .1461 Triad: 10684.2826 .1510 .1507 .1512 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 512 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10390.6283 .1034 .1033 .1035 Scale: 10633.8233 .1011 .1010 .1011 Add: 11236.3952 .1436 .1433 .1439 Triad: 11277.8142 .1429 .1428 .1431 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 640 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10279.2574 .1050 .1045 .1053 Scale: 10520.5363 .1025 .1021 .1028 Add: 10827.5397 .1490 .1488 .1494 Triad: 11325.5936 .1427 .1422 .1430 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 768 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10574.3123 .1020 .1015 .1024 Scale: 9975.4460 .1133 .1076 .1234 Add: 11141.0616 .1450 .1446 .1453 Triad: 10591.4384 .1526 .1521 .1535 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 896 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10660.3662 .1013 .1007 .1019 Scale: 7701.3703 .1450 .1394 .1547 Add: 11172.0803 .1443 .1442 .1444 Triad: 8228.7938 .1959 .1957 .1962 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1024 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10416.9212 .1034 .1031 .1038 Scale: 10495.1001 .1024 .1023 .1025 Add: 10450.8440 .1545 .1541 .1550 Triad: 10453.4315 .1546 .1541 .1549 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1152 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10321.1899 .1044 .1040 .1050 Scale: 10329.7597 .1040 .1039 .1042 Add: 8674.0799 .1870 .1857 .1881 Triad: 11251.9572 .1435 .1431 .1441 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1280 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10659.2308 .1012 .1007 .1017 Scale: 10714.2721 .1004 .1002 .1007 Add: 10663.9170 .1511 .1510 .1512 Triad: 11156.1487 .1445 .1444 .1447 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1408 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10728.2076 .1005 .1001 .1007 Scale: 10300.5343 .1048 .1042 .1053 Add: 11245.3828 .1435 .1432 .1438 Triad: 10689.1515 .1507 .1507 .1508 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1536 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 10396.7090 .1034 .1033 .1036 Scale: 10579.7277 .1016 .1015 .1017 Add: 11238.7038 .1440 .1433 .1448 Triad: 11274.8496 .1431 .1429 .1434 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- bindprocessor successful: thread_self() 13085 cpu_id 0 bindprocessor successful: thread_self() 44137 cpu_id 1 bindprocessor successful: thread_self() 49305 cpu_id 2 bindprocessor successful: thread_self() 45697 cpu_id 3 bindprocessor successful: thread_self() 18737 cpu_id 6 bindprocessor successful: thread_self() 59663 cpu_id 7 bindprocessor successful: thread_self() 53571 cpu_id 5 bindprocessor successful: thread_self() 50339 cpu_id 4