Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

macincloud systems reporting 100% inode use in Nagios #3373

Closed
sxa opened this issue Feb 7, 2024 · 2 comments
Closed

macincloud systems reporting 100% inode use in Nagios #3373

sxa opened this issue Feb 7, 2024 · 2 comments
Assignees
Labels
Nagios Nagios monitoring issues os:macOS

Comments

@sxa
Copy link
Member

sxa commented Feb 7, 2024

We've got a lot of these messages in the infrastructure-bot channel related to the macincloud which is concerning and likely needs remediation if it's correct for the macos file system:
HOST: test-macincloud-macos1201-x64-1 SERVICE: Disk Space Root Partition STATE: WARNING MESSAGE: DISK WARNING - free space: / 16525 MiB (13.47% inode=100%)
HOST: test-macincloud-macos1201-x64-2 SERVICE: Disk Space Root Partition STATE: WARNING MESSAGE: DISK WARNING - free space: / 23837 MiB (19.43% inode=100%)

Originally posted by @sxa in #3372 (comment)

Noting that they are currently offline while we're testing orka, but we should look at checking and resolving this, then re-enabling them.

@sxa
Copy link
Member Author

sxa commented Feb 9, 2024

@steelhead31 Can you take a look at the cause of this - it looks like there is an automounter controlling /home which may be throwing Nagios - also bear in mind that the home directories are under /Users instead of /home on macos.

map auto_home            0        0         0   100%       0         0  100%   /System/Volumes/Data/home

@steelhead31 steelhead31 self-assigned this Feb 9, 2024
@steelhead31 steelhead31 added Nagios Nagios monitoring issues os:macOS labels Feb 9, 2024
@steelhead31 steelhead31 moved this from Todo to In Progress in 2024 1Q Adoptium Plan Feb 9, 2024
@steelhead31
Copy link
Contributor

Having done some digging, the root file system for these machines is split across 2 volumes.. e.g

DISK WARNING - free space:
/ 16523 MiB (13.46% inode=100%);
| /=106156MiB;98143;110411;0;122679

As a result, the nagios check_disk plugin needs an additional parameter to calculate the combined space of both of these file systems, the --combine-thresholds parameter should be specified..

DISK OK - free space:
/ 16523 MiB (13.46% inode=100%);
| /=106156MiB;98143;110411;0;122679
 

A small update to the nagios configuration now has these filesystems alerting properly :)

We should probably look to extend the coverage to other file systems, if these continue to be required.

@github-project-automation github-project-automation bot moved this from In Progress to Done in 2024 1Q Adoptium Plan Feb 9, 2024
@sxa sxa added this to the 2024-02 (February) milestone Feb 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Nagios Nagios monitoring issues os:macOS
Projects
Status: Done
Development

No branches or pull requests

2 participants