Hi Everyone,
TL;DR: Has anyone dealt with debugging why a SOAR remote agent would start failing? How did you manage to get the Agent to connect again?
---
I realized today (after several playbooks started failing) that the remote agent that we set up hasn't connected since the 11th of May. I currently have a case open with support but getting a fix may take a while.
When opening up the Remote Agent page, it shows that the Status is "FAILED" and when I download the logs for the last week, the file itself is empty (I'm assuming this is due to the FAILED connection for over a week).
When I dive into the host, the docker connection is still live and healthy, so I'm not sure where to begin since it's likely on the Google side. (Initial deployment was done through the "Send Installer/docker Command" action within the SOAR itself.
I've done some debugging on my end, including:
- Reviewing Docker logs on the host - nothing new in the logs over the past 3 weeks which could be related to the current FAILED status.
- Disabling / re-enabling the agent but the status still shows as failed and I am unable to find the logs which could point to the reason.
- Deployed a new agent which auto-fails.
Questions:
- Is anyone having the same issue with remote agents?
- How are you monitoring for agents changing status to Failed?
- Have you been able to resolve an issue like this? How did you debug and fix the issue?
TIA