Go to file
2018-07-22 07:09:17 +09:00
.gitignore add support white, black, avoid, prefer lists 2018-06-23 15:39:17 +09:00
default_values_backup.conf add default_values_backup.conf 2018-07-21 14:45:36 +09:00
install.sh update readme 2018-07-21 17:39:47 +09:00
LICENSE 1 2018-06-08 17:08:16 +09:00
nohang cosmetic fix 2018-07-22 06:45:20 +09:00
nohang.1 update readme, man, conf 2018-07-10 05:16:56 +09:00
nohang.conf cosmetic fix 2018-07-22 06:45:20 +09:00
nohang.service drop-out logging, blacklist and whitelist support, add realtime_ionice support 2018-07-11 04:30:27 +09:00
purge.sh drop-out logging, blacklist and whitelist support, add realtime_ionice support 2018-07-11 04:30:27 +09:00
README.md update readme 2018-07-22 07:09:17 +09:00

Nohang

Nohang is a highly configurable daemon for Linux which is able to correctly prevent out of memory (OOM) conditions and save disk cache.

What is the problem?

OOM killer doesn't prevent OOM conditions. And OOM conditions may cause freezes, livelocks, drop caches and killing (via SIGKILL) multiple processes instead of terminating (via SIGTERM) one process.

Here are the statements of some users:

"How do I prevent Linux from freezing when out of memory? Today I (accidentally) ran some program on my Linux box that quickly used a lot of memory. My system froze, became unresponsive and thus I was unable to kill the offender. How can I prevent this in the future? Can't it at least keep a responsive core or something running?"

(serverfault)

"With or without swap it still freezes before the OOM killer gets run automatically. This is really a kernel bug that should be fixed (i.e. run OOM killer earlier, before dropping all disk cache). Unfortunately kernel developers and a lot of other folk fail to see the problem. Common suggestions such as disable/enable swap, buy more RAM, run less processes, set limits etc. do not address the underlying problem that the kernel's low memory handling sucks camel's balls."

(serverfault)

Also look at Why are low memory conditions handled so badly? (discussion with 480+ posts on r/linux).

Solution

  • Use of earlyoom. This is a simple and lightweight OOM preventer written in C.
  • Use of oomd. This is a userspace OOM killer for linux systems whitten in C++ and developed by Facebook.
  • Use of nohang.

Some features

  • SIGKILL and SIGTERM as signals that can be sent to the victim
  • impact on the badness of processes via matching their names with regular expressions
  • possibility of restarting processes via command like systemctl restart something if the process is selected as a victim
  • GUI notifications: results of preventings OOM and low memory warnings
  • zram support (mem_used_total as a trigger)
  • customizable intensity of monitoring
  • convenient configuration with a well commented config file (there are 35 parameters in the config)
  • look at the config to find more

Demo

Video: nohang prevents OOM after the command while true; do tail /dev/zero; done has been executed.

Requirements

  • Linux 3.14+ and Python 3.4+ for basic use
  • libnotify (Fedora, Arch) or libnotify-bin (Debian, Ubuntu) to show GUI notifications

Memory and CPU usage

  • VmRSS is 10 — 14 MiB depending on the settings
  • CPU usage depends on the level of available memory (the frequency of memory status checks increases as the amount of available memory decreases) and monitoring intensity (can be changed by user via config)

Status

The program is unstable and some fixes are required before the first stable version will be released (need documentation, translation, review and some optimisation).

Download

$ git clone https://github.com/hakavlad/nohang.git
$ cd nohang

Installation and start for systemd users

$ sudo ./install.sh

Purge

$ sudo ./purge.sh

Command line options

./nohang -h
usage: nohang [-h] [-c CONFIG]

optional arguments:
  -h, --help            show this help message and exit
  -c CONFIG, --config CONFIG
                        path to the config file, default values:
                        ./nohang.conf, /etc/nohang/nohang.conf

How to configure nohang

The program can be configured by editing the config file. The configuration includes the following sections:

  1. Memory levels to respond to as an OOM threat
  2. The frequency of checking the level of available memory (and CPU usage)
  3. The prevention of killing innocent victims
  4. Impact on the badness of processes via matching their names with regular expressions
  5. The execution of a specific command instead of sending the SIGTERM signal
  6. GUI notifications:
    • results of preventing OOM
    • low memory warnings
  7. Preventing the slowing down of the program
  8. Output verbosity

Just read the description of the parameters and edit the values. Please restart nohang to apply changes. Default path to the config arter installing via ./install.sh is /etc/nohang/nohang.conf.

Feedback

Please create issues. Use cases, feature requests and any questions are welcome.