Requesting Large Pages Shared Segment Pointer = 504403158265495552 Segment Size (DW) = 268435456 (MB = 2048 ) Vector Size (DW) = 67108864 (MB = 512 ) Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 Num_threads = 8 rebind: num_parthds is 8 Starting Initialization Done With Initialization a(1) 1.00000000000000000 a(N) 0.000000000000000000E+00 Base Offset = 67108864 Incremental Offset = 384 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6149.2999 .1749 .1746 .1751 Scale: 6033.7535 .1783 .1780 .1787 Add: 7146.6613 .2260 .2254 .2265 Triad: 7260.9933 .2225 .2218 .2230 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 512 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6143.9476 .1754 .1748 .1760 Scale: 6104.2665 .1761 .1759 .1763 Add: 7225.3855 .2239 .2229 .2246 Triad: 7264.6085 .2235 .2217 .2244 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 640 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6017.4683 .1789 .1784 .1793 Scale: 5969.1240 .1803 .1799 .1811 Add: 7075.1577 .2282 .2276 .2288 Triad: 7030.5250 .2292 .2291 .2294 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 768 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6133.0873 .1754 .1751 .1756 Scale: 6005.3800 .1793 .1788 .1798 Add: 7212.0175 .2242 .2233 .2248 Triad: 7193.0995 .2247 .2239 .2256 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 896 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6225.1490 .1731 .1725 .1736 Scale: 6070.8176 .1773 .1769 .1777 Add: 7130.2209 .2260 .2259 .2261 Triad: 7228.3929 .2233 .2228 .2243 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1024 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6222.2721 .1727 .1726 .1730 Scale: 6099.5003 .1762 .1760 .1763 Add: 7276.7925 .2225 .2213 .2236 Triad: 7345.3484 .2197 .2193 .2203 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1152 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6022.7151 .1786 .1783 .1793 Scale: 6066.3201 .1777 .1770 .1786 Add: 7177.3105 .2251 .2244 .2264 Triad: 7225.9342 .2241 .2229 .2247 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1280 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6126.3879 .1755 .1753 .1758 Scale: 6108.4974 .1764 .1758 .1768 Add: 7205.7018 .2244 .2235 .2258 Triad: 7209.2929 .2241 .2234 .2246 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1408 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6148.3302 .1753 .1746 .1757 Scale: 6047.9252 .1779 .1775 .1784 Add: 7029.0363 .2309 .2291 .2321 Triad: 7136.4804 .2260 .2257 .2262 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- Base Offset = 67108864 Incremental Offset = 1536 ---------------------------------------------- Double precision appears to have 16 digits of accuracy Assuming 8 bytes per DOUBLE PRECISION word ---------------------------------------------- Array size = 67108864 The total memory requirement is 1536 MB You are running each test 5 times -- The *best* time for each test is used *EXCLUDING* the first and last iterations ---------------------------------------------------- Your clock granularity appears to be less than one microsecond Your clock granularity/precision appears to be 1 microseconds ---------------------------------------------------- Function Rate (MB/s) Avg time Min time Max time Copy: 6150.9334 .1751 .1746 .1756 Scale: 6110.3828 .1760 .1757 .1762 Add: 7284.4077 .2231 .2211 .2247 Triad: 7306.0649 .2233 .2204 .2254 ---------------------------------------------------- Solution Validates! ---------------------------------------------------- bindprocessor successful: thread_self() 581757 cpu_id 2 bindprocessor successful: thread_self() 651431 cpu_id 3 bindprocessor successful: thread_self() 446553 cpu_id 6 bindprocessor successful: thread_self() 381021 cpu_id 7 bindprocessor successful: thread_self() 630987 cpu_id 5 bindprocessor successful: thread_self() 606389 cpu_id 4 bindprocessor successful: thread_self() 634993 cpu_id 0 bindprocessor successful: thread_self() 614577 cpu_id 1