This playbook describes steps to resolve issues when scheduled SCIM provisioning fails.
Overview
SCIM syncs are critical for provisioning users to CMA, ensuring seamless onboarding and consistent access to resources. The sync frequency depends on the Identity Provider (IdP) used, as described in SCIM User Provisioning.
When SCIM provisioning fails, newly created users may be unable to connect or access required services, and security policies may not be enforced correctly. To help ensure rapid detection and resolution, an XOps story is automatically generated whenever a provisioning failure occurs between the IdP and CMA.
When responding to Network XOps stories, it is essential to approach the problem in a systematic manner. First, verify that the issue is ongoing, then troubleshoot it, and finally confirm that the problem is resolved.
Step 1 - Verifying the SCIM Sync Failed
The following are the different ways that a Cato Management Application admin can verify that a SCIM Sync has failed.
- An XOps story will be generated when the SCIM Sync fails.
- Go to the Stories Workbench page and use the Network Operations preset, including the filter 'Indication Contains SCIM'. Adjust the time frame as necessary.
- Verify if a story is generated as shown below.
- Click on the story to drill down into the details. It provides information on the story status, an incident timeline, and, more importantly, the status of the SCIM sync.
- As you scroll further down in the story drill-down, you'll find the Incident Timeline. This timeline highlights any changes in the status of the SCIM syncs. On the right pane, you’ll see the playbook workflow that outlines the steps for troubleshooting the issue.
Using the Event
- SCIM sync failures can also be verified by examining the relevant event entries.
- To view this event, filter the Event Dashboard by setting Sub-Type to SCIM Provisioning and Action to Failed. Adjust the timeframe as needed to match when the issue occurred.
- If a SCIM sync failure is detected, you will see events similar to the example shown below. The event message will show the reason for the sync failure. In the example shown below, this is due to an "Internal Server Error".
This section outlines the tools available in Cato for a structured troubleshooting approach to incidents of this type. While the steps are generally meant to be followed in order, the results of each check may influence the next step in the process.
- To determine whether the issue was a one-time occurrence, perform a provision on demand from the IdP platform. For Azure, navigate to Enterprise Applications > Cato Networks Provisioning > Provisioning and click "Provision on demand."
- Select a user or group assigned to the Application and click the "Provision" button.
- If the SCIM sync completes successfully, it may indicate that the SCIM Sync Failure was an isolated incident. The administrator should verify whether any provider interruptions or Cato maintenance activities coincided with the SCIM sync time.
- If the SCIM sync also fails, it indicates that the issue persists and the sync with the IdP is not completing successfully. In this case, review any recent configuration changes that may have led to the problem.
Reviewing Changes in Audit Trail
- Review the changes on the Audit Trail page to determine if a configuration change is the cause of this issue. This step is especially important if the scheduled sync had been functioning normally but stopped working unexpectedly.
- To view any changes made to the domain configuration, filter the Audit Dashboard by setting Model Type to Domain. Adjust the timeframe as needed to match when the issue occurred.
- For example, the screenshot below shows that the admin made configuration changes to the Okta SCIM provisioning. If the timing of this activity aligns with the SCIM Sync Failure, the admin can revert the changes to determine if the changes are the cause.
- Another factor that could affect SCIM connectivity is changes to the Application on the IdP side. For Azure, navigate to Enterprise Applications > Cato Networks Provisioning and check the Audit logs and Provisioning logs to determine whether a configuration change triggered the SCIM failure.
Update Admin Credentials
To verify a credential failure with the IdP, generate a new token from CMA under Directory Services > SCIM. Click "generate token" and copy the new token.
For Azure, navigate to Enterprise Applications > Cato Networks Provisioning > Provisioning and expand Provisioning > Admin Credentials. Enter the token generated from CMA and click "Test Connection".
-
If the authentication succeeds, the issue may be related to a token mismatch between the IdP and CMA.
After identifying and resolving the issue that caused the SCIM sync to fail, verify that the sync is now showing as resolved in the Story.
NOTE: Once the issue is resolved, the status of the story will change from "Open" to "Monitoring." It will remain in this state for the next hour, provided there are no further incidents. For more information, refer to Understanding the Stories Columns.
Raising Cases to Cato Support
If following this playbook has not resolved an issue, submit a Support ticket. To get the most helpful response to a request, an administrator should provide the results of the troubleshooting steps taken.
0 comments
Please sign in to leave a comment.