Requesting Large Pages Shared Segment Pointer = 504403158265495552 Segment Size (DW) = 268435456 (MB = 2048 ) Vector Size (DW) = 67108864 (MB = 512 ) Num_threads = 4 Num_threads = 4 Num_threads = 4 Num_threads = 4 rebind: num_parthds is 4 Starting Initialization Done With Initialization a(1) 1.00000000000000000 a(N) 0.000000000000000000E+00 Base Offset = 67108864 Incremental Offset = 384 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 8219.5674 .1307 .1306 .1308 Scale: 7766.7820 .1384 .1382 .1385 Add: 11030.9345 .1460 .1460 .1461 Triad: 10812.6268 .1492 .1490 .1495 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 512 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 7899.0287 .1361 .1359 .1362 Scale: 8051.5237 .1338 .1334 .1342 Add: 11069.0360 .1457 .1455 .1459 Triad: 11009.9449 .1470 .1463 .1478 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 640 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 8243.0519 .1304 .1303 .1305 Scale: 8113.0298 .1324 .1323 .1325 Add: 10818.9127 .1491 .1489 .1494 Triad: 11183.5289 .1447 .1440 .1461 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 768 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 8086.5170 .1328 .1328 .1329 Scale: 8204.6455 .1311 .1309 .1312 Add: 11158.9406 .1445 .1443 .1447 Triad: 11032.5108 .1462 .1460 .1464 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 896 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 8179.0166 .1315 .1313 .1316 Scale: 7246.3272 .1488 .1482 .1492 Add: 10681.3094 .1509 .1508 .1511 Triad: 9031.0895 .1787 .1783 .1792 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1024 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 7909.4054 .1361 .1358 .1363 Scale: 7862.0246 .1367 .1366 .1370 Add: 10175.4341 .1583 .1583 .1584 Triad: 10171.7186 .1587 .1583 .1591 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1152 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 8227.2254 .1308 .1305 .1309 Scale: 8002.6150 .1378 .1342 .1448 Add: 9082.4504 .1784 .1773 .1796 Triad: 10702.2841 .1507 .1505 .1511 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1280 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 8108.6184 .1325 .1324 .1326 Scale: 8183.8544 .1314 .1312 .1315 Add: 11004.9408 .1464 .1464 .1464 Triad: 11172.5976 .1444 .1442 .1445 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1408 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 8212.7249 .1308 .1307 .1308 Scale: 7763.8163 .1383 .1383 .1384 Add: 11183.6955 .1441 .1440 .1442 Triad: 10834.5032 .1492 .1487 .1503 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1536 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 7884.6188 .1363 .1362 .1363 Scale: 8077.6043 .1330 .1329 .1331 Add: 11031.0065 .1462 .1460 .1465 Triad: 11006.5366 .1466 .1463 .1472 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- bindprocessor successful: thread_self() 1572869 cpu_id 3 bindprocessor successful: thread_self() 1433755 cpu_id 2 bindprocessor successful: thread_self() 458885 cpu_id 1 bindprocessor successful: thread_self() 1613863 cpu_id 0