User Tools

Site Tools


new_server

Old Hardware

  • AMD Athlon 64 X2 6000+ 3.0GHz 125W
  • 8GB DDR3

New Hardware

  • AMD Ryzen 1600 3.2GHz 65W
  • 32GB DDR4 2400
  • APC BN700MC Back-UPS 700 VA (420W)
  • ATI Radeon X600 ($4.99 cheap PCIe graphics card)

Configurations

Ubuntu Server 17.04 (US)

  • Kernel version 4.10
  • gcc version 6.3.0

Fedora Server 26 (FS)

  • Kernel version 4.11.9
  • gcc version 7.1.1

Benchmarks

Test Old New (US) New (FS)
Idle Power 65W 56W (w/ video card)
Full Load Power 187W 132W
dd if=/dev/urandom of=outfile bs=1000000 count=1000 1m1.045s (16.4 MB/s) 14.425s (69.3 MB/s) 14.543s (68.8 MB/s)
dd if=outfile of=/dev/null bs=1000000
(after reboot to clear disk cache)
6.556s (153 MB/s) 1.058s (947 MB/s) 0.944s (1.1 GB/s)
Phoronix ramspeed (Average, Integer) 2988.37 MB/s won't compile won't compile
Phoronix ramspeed (Average, Floating Point) 3149.90 MB/s won't compile won't compile
Phoronix aio-stress 682.69 MB/s 2713.93 MB/s 2675.22 MB/s
Phoronix n-queens 136.92s 24.81s 22.86s

Bonnie++ in a VM

Command:

bonnie++ -s 3g

Executed on a VM with 768MB of RAM.

Old Hardware

ide interface

Version  1.97       ------Sequential Output------ --Sequential Input- --Random-
Concurrency   1     -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine        Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP  /sec %CP
hathor           3G   748  98 85183  23 63873  27  2543  96 295344  57  1971 195
Latency             23852us    9710ms    6711ms    9268us   10251us   43389us
Version  1.97       ------Sequential Create------ --------Random Create--------
hathor              -Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
              files  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
                 16 +++++ +++ +++++ +++ +++++ +++ +++++ +++ +++++ +++ +++++ +++
Latency             15314us     591us     778us    3577us     913us     217us
1.97,1.97,hathor,1,1499995713,3G,,748,98,85183,23,63873,27,2543,96,295344,57,1971,195,16,,,,,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++++,+++,23852us,9710ms,6711ms,9268us,10251us,43389us,15314us,591us,778us,3577us,913us,217us

virtio interface

Version  1.97       ------Sequential Output------ --Sequential Input- --Random-
Concurrency   1     -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine        Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP  /sec %CP
hathor           3G   752  97 100045  19 97967  23  2584  98 737193  70  3204  57
Latency             30917us     656ms     235ms    9763us    4836us    7188us
Version  1.97       ------Sequential Create------ --------Random Create--------
hathor              -Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
              files  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
                 16 +++++ +++ +++++ +++ +++++ +++ +++++ +++ +++++ +++ +++++ +++
Latency              9398us     595us     647us    1329us      48us     226us
1.97,1.97,hathor,1,1499994609,3G,,752,97,100045,19,97967,23,2584,98,737193,70,3204,57,16,,,,,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++++,+++,30917us,656ms,235ms,9763us,4836us,7188us,9398us,595us,647us,1329us,48us,226us

New Hardware (FS)

virtio interface

Version  1.97       ------Sequential Output------ --Sequential Input- --Random-
Concurrency   1     -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine        Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP  /sec %CP
hathor           3G  1591  99 195864   9 192590  10 +++++ +++ 2287940  56 +++++ +++
Latency             13792us    3537us    3623us    3168us    1356us    1846us
Version  1.97       ------Sequential Create------ --------Random Create--------
hathor              -Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
              files  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP  /sec %CP
                 16 +++++ +++ +++++ +++ +++++ +++ +++++ +++ +++++ +++ +++++ +++
Latency              2731us     267us     257us     122us      25us     122us
1.97,1.97,hathor,1,1500205275,3G,,1591,99,195864,9,192590,10,+++++,+++,2287940,56,+++++,+++,16,,,,,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++++,+++,+++++,+++,13792us,3537us,3623us,3168us,1356us,1846us,2731us,267us,257us,122us,25us,122us

Installation

  • Install base F26 server.
  • dnf update
  • dnf install \
      vim \
      qemu-system-x86 \
      qemu-kvm \
      apcupsd \
      nfs-utils \
      libvirt-daemon-qemu \
      libvirt-client \
      ssmtp
  • Edit /etc/default/grub:
    • Remove rhgb
    • Decrease timeout
  • grub2-mkconfig -o /boot/efi/EFI/fedora/grub.cfg
  • Add /etc/sysconfig/network-scripts/ifcfg-br0:
    DEVICE=br0
    TYPE=Bridge
    DELAY=0
    NM_CONTROLLED=no
    ONBOOT=yes
    BOOTPROTO=static
    IPADDR=192.168.16.20
    NETMASK=255.255.255.0
    GATEWAY=192.168.16.1
  • In /etc/sysconfig/network-scripts/ifcfg-enp7s0, change BOOTPROTO to none, and add BRIDGE=br0 and NM_CONTROLLED=no
  • chkconfig network on
  • edit /etc/apcupsd/apcupsd.conf and /etc/apcupsd/apccontrol
  • systemctl enable apcupsd.service
  • systemctl enable libvirt-guests
  • systemctl enable nfs-server.service
  • systemctl enable rpcbind
  • firewall-cmd –add-service=nfs –permanent
  • Do fixes in https://bugzilla.redhat.com/show_bug.cgi?id=1472062 (disable SELinux and reduce stop timeout of apcupsd service to 10s) to allow apcupsd to work properly
  • Configure ssmtp in /etc/ssmtp/ssmtp.conf

Services

Server Services OS Hardware
anubis (host)
  • apcupsd
  • NFS
  • KVM (libvirt-guests)
  • gmail backups
  • NFS backups
Fedora Server 26 32GB RAM; 12 CPU threads
oneill (VM)
  • HTTP
    • wiki
    • tt-rss
    • mythweb (proxy)
    • cameras (proxy)
    • cgit (proxy)
Ubuntu Server 16.04 2GB RAM; 1 VCPU
carter (VM)
  • git-daemon
  • gitolite git hosting over ssh
  • cgit
Ubuntu Server 16.04 2GB RAM; 1 VCPU
baal (VM)
  • openvpn
  • squid proxy
Fedora Server 26 1GB RAM; 1 VCPU
hathor (VM)
  • minetest
Ubuntu Server 16.04 2GB RAM; 1 VCPU
ra (VM)
  • mythtv backend
  • mythweb
Mythbuntu 16.04 4GB RAM; 2 VCPUs

Issues

APC UPS not killing power

I think this is a Fedora / systemd problem. apcupsd will detect the AC power failure and initiate a shutdown (shutdown -h -H now via apccontrol). But apccontrol killpower does not seem to be called. Also, during shutdown, systemd reports “A stop job is running for APC UPS Power Control Daemon for Linux”. Not sure if this is because the apcupsd service was what kicked off the shutdown command, so it wasn't able to be stopped.

Bug report filed: https://bugzilla.redhat.com/show_bug.cgi?id=1472062 Problem of not cutting UPS power seems to be due to /etc/apcupsd/powerfail file not getting created which might be because of SELinux. Other part of the problem (hanging on stopping the apcupsd service) still remains but is worked around by changing the service timeout to 10s.

M.2 PCIe NVMe drive disappearing

Three times now my server has stopped responding and shows a blank console. After resetting and going into the UEFI setup the Western Digital M.2 drive no longer shows up. After a poweroff and cold boot, the drive reappears in UEFI setup and I have to reset my boot settings to select it as the default boot entry. The system appears to run properly after this until the next time.

The problem I'm observing seems very similar to this: https://superuser.com/questions/1194478/ssd-suddenly-becomes-unreadable-how-to-diagnose

After the third time this happened, on 2017-12-13, I updated the UEFI firmware on the ASRock motherboard to version 3.30. The upgrade was successful, and after resetting my options (particularly re-enabling SVM and power on after AC loss), the system appears to be running properly again now.

On 2017-12-15 the system froze again. I disabled “C6 Mode” in the UEFI setup and started up again.

2017-12-17: I have not observed the problem since disabling “C6 Mode”, but I have a feeling it will still come back and could be related to NVMe APST modes. Similar problem here: https://bbs.archlinux.org/viewtopic.php?id=232692. I added kernel parameter:

nvme_core.default_ps_max_latency_us=0

Now nvme get-feature -f 0x0c -H /dev/nvme0n1 shows APST is disabled.

Turned “C6 Mode” back on but have not observed the M.2 drive disappearing problem again with APST disabled.

BUG: soft lockup

A few times after fixing the M.2 drive disappearing issue, my server has frozen. One of the times I caught the console output which listed several messages like “watchdog: BUG: soft lockup - CPU#10 stuck for 22s! [worker:14788]”, and one “INFO: rcu_sched detected stalls on CPUs/tasks:” … “rcu_sched kthread starved for 19150377 jiffies!”. I'm not sure what is causing this.

On 2018-01-08, I upgraded from kernel 4.14.4 to 4.14.11 and added “consoleblank=0” to the kernel command line so if this happens again hopefully I will not lose console output.

2018-01-09: Igor seems to be experiencing the same bug and pointed me to https://bugzilla.kernel.org/show_bug.cgi?id=196683. I will probably disable C6 in UEFI setup again.

2018-01-15: Got the freeze again after disabling C6 mode in setup. I looked deeper in setup options and found the buried “Global C-State Control” option that Igor and Mike had disabled so I disabled that as well. No freezes since then.

new_server.txt · Last modified: by josh