Troubleshooting 'su' Authentication: The PAM system-auth Pitfall

Situation

A common issue when integrating Linux servers with Active Directory (AD) is that users may find themselves unable to use the su command to switch to another account, even if they provide the correct password. They are often met with a generic Permission Denied error.

This usually isn’t a password issue, but a configuration conflict in the Pluggable Authentication Modules (PAM) stack.

The Problem: Broken system-auth

On many RHEL-based systems, the /etc/pam.d/su configuration references a global system-auth file.

The issue often arises when system-auth includes modules that are not appropriate for a server environment, such as pam_fprintd.so (fingerprint authentication). If the fingerprint service is “broken” or the hardware is missing, the entire authentication chain can fail before it even checks the user’s password.

The Fix: Switching to password-auth

The most reliable fix is to point the su service to the password-auth stack, which is the same stack used by SSH. Since you know SSH logins are working, using this stack ensures consistency.

Step 1: Modify PAM configuration

Edit /etc/pam.d/su and replace the references to system-auth with password-auth.

# /etc/pam.d/su
auth        substack      password-auth
account     substack      password-auth
session     substack      password-auth

Step 2: Restrict ‘su’ to the Wheel Group

If you want to ensure only authorized administrators can use su, ensure the pam_wheel.so module is correctly configured to check the local wheel group.

auth           required        pam_wheel.so use_uid

SSSD Access Provider Issues

Sometimes, the password is correct and PAM is clean, but the SSSD (System Security Services Daemon) is still blocking the switch because of AD-specific rules.

If the user is valid but AD is blocking the service locally, check your /etc/sssd/sssd.conf. If you have a complex GPO or access control setup that is causing issues, you can temporarily change the access provider to simple to isolate the problem:

# /etc/sssd/sssd.conf
[domain/yourdomain]
# Change from 'ad' to 'simple' for testing
access_provider = simple
simple_allow_groups = wheel, linux_admins_ad

Summary

When su fails on an AD-joined Linux server:

Check the logs: journalctl -u sssd and /var/log/secure.
Look for “broken” modules in system-auth (like fingerprints).
Align your su configuration with the working password-auth stack.
Verify that SSSD isn’t over-filtering access via its access_provider.

Architecture Diagram

Troubleshooting 'su' Authentication: The PAM system-auth Pitfall execution diagram

This diagram visualizes the PAM Stack Fail-Fast condition. When falling back to system-auth, unexpected modules like pam_fprintd.so can trigger an immediate denial before pam_unix.so or SSSD is even evaluated. Explicitly routing su requests through password-auth ensures strict alignment with sshd and reliable Active Directory validation.

Post-Specific Engineering Lens

For this post, the primary objective is: Harden service integration points and reduce operational surprises.

Implementation decisions for this case

Chose a staged approach centered on linux to avoid high-blast-radius rollouts.
Used pam checkpoints to make regressions observable before full rollout.
Treated security documentation as part of delivery, not a post-task artifact.

Practical command path

These are representative execution checkpoints relevant to this post:

systemctl status <service>
ss -tulpn
journalctl -u <service> -n 200 --no-pager

Validation Matrix

Validation goal	What to baseline	What confirms success
Functional stability	service availability, package state, SELinux/firewall posture	`systemctl --failed` stays empty
Operational safety	rollback ownership + change window	`journalctl -p err -b` has no new regressions
Production readiness	monitoring visibility and handoff notes	critical endpoint checks pass from at least two network zones

Failure Modes and Mitigations

Failure mode	Why it appears in this type of work	Mitigation used in this post pattern
Auth or trust mismatch	Service looks up but rejects real traffic	Validate identity chain and clock/DNS assumptions
Policy-control conflict	SELinux/firewall blocks valid paths	Capture allow-list requirements before rollout
Partial restart strategy	Config is applied but not activated safely	Use staged restart with health gates

Recruiter-Readable Impact Summary

Scope: deliver Linux platform changes with controlled blast radius.
Execution quality: guarded by staged checks and explicit rollback triggers.
Outcome signal: repeatable implementation that can be handed over without hidden steps.

Engineer Command Palette

Troubleshooting 'su' Authentication: The PAM system-auth Pitfall

Case Snapshot

Situation

Issue

Solution

Used In

Impact

Situation

The Problem: Broken system-auth

The Fix: Switching to password-auth

Step 1: Modify PAM configuration

Step 2: Restrict ‘su’ to the Wheel Group

SSSD Access Provider Issues

Summary

Architecture Diagram

Post-Specific Engineering Lens

Implementation decisions for this case

Practical command path

Validation Matrix

Failure Modes and Mitigations

Recruiter-Readable Impact Summary