Output of stream_d

From: Thomas Dean (tomdean@ix.netcom.com)
Date: Mon Feb 15 1999 - 06:42:51 CST


My system is a DEC CELEBRIS 5133DP, 2ea, 133Mhz Pentium Processors,
running FreeBSD:

Copyright (c) 1982, 1986, 1989, 1991, 1993
        The Regents of the University of California. All rights reserved.
FreeBSD 4.0-CURRENT #4: Mon Feb 15 03:34:29 PST 1999
    tomdean@celebris:/usr/src/sys/compile/CELEBRIS-SMP
Timecounter-tdd "i8254" frequency 1193032 Hz cost 2547 ns
CPU: Pentium/P54C (586-class CPU)
  Origin = "GenuineIntel" Id = 0x525 Stepping=5
  Features=0x3bf<FPU,VME,DE,PSE,TSC,MSR,MCE,CX8,APIC>
real memory = 100663296 (98304K bytes)
avail memory = 95055872 (92828K bytes)
Programming 16 pins in IOAPIC #0
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id: 0, version: 0x00030010, at 0xfee00000
 cpu1 (AP): apic id: 1, version: 0x00030010, at 0xfee00000
 io0 (APIC): apic id: 2, version: 0x000f0011, at 0xfec00000
Preloaded elf kernel "kernel" at 0xf02b0000.
Probing for devices on PCI bus 0:
chip0: <Intel 82434NX (Neptune) PCI cache memory controller> rev 0x11 on pci0.0.
0
ncr0: <ncr 53c810 fast10 scsi> rev 0x02 int a irq 11 on pci0.1.0
chip1: <Intel 82378IB PCI to ISA bridge> rev 0x88 on pci0.2.0

=============================

Here is the result of running stream_d on my system.

# time stream_d
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 1000000, Offset = 0
Total memory required = 22 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity appears to be less than one microsecond.
Each test below will take on the order of 273437 microseconds.
Floating exception (core dumped)
1.524u 2.264s 0:10.00 37.8% 6+30720k 0+1697io 0pf+0w
celebris: {71} !cc
cc -O2 -m486 stream_d.c second_cpu.c -o stream_d -lm
celebris: {72} time stream_d
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 1000000, Offset = 0
Total memory required = 22 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Your clock granularity/precision appears to be 7812 microseconds.
Each test below will take on the order of 273437 microseconds.
   (= 35 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function Rate (MB/s) RMS time Min time Max time
Copy: 33.5738 0.4899 0.4766 0.5000
Scale: 47.6279 0.3407 0.3359 0.3516
Add: 52.0678 0.4696 0.4609 0.4766
Triad: 51.2000 0.4836 0.4688 0.4922
19.191u 0.788s 0:19.97 100.0% 5+30244k 0+0io 0pf+0w



This archive was generated by hypermail 2b29 : Tue Apr 18 2000 - 05:23:08 CDT