Incident: Cannot Submit Support Ticket Request
- Date of incident: Sat March 8, 2025
- Recovery time: 3 hours
- Impacted product(s): Support Site
- Primary investigator(s): Anand Patil, Sanjay Akula, Roberto Crisial
- Incident repair team: Sanjay Akula, Roberto Crisial, Steve Hurlock
Timeline
- All times EST on Saturday 03/08/25.
- The outage lasted 8 hours, from 3:30 am to 11:30 am.
- We became aware of it at 9 am, making our recovery time 2.5 hours.
5:21pm – Kshitij Tilekar reports in the Incident channel: Unable to submit a Support Ticket Request Incident Response I am unable to submit a Support Ticket Request. The Category and Type of Issue drop-downs show no options in them.
5:36pm – There is a discussion about what info comes from Salesforce (since we saw recent issues with Salesforce connections on support2), but Rob and Sanjay see that there are no Categories being populated.
6:26pm – Jared confirms that Categories do not come from Salesforce and contacts Anand.
7:12pm – Anand finds that the support site is not able to connect to MongoDB.
7:55pm – Jared posts an alert about the issue but can't update the support banner.
~8:15pm – Sanjay and Anand restart the support app, but see that it can't connect to MongoDB. Sanjay reboots the support server, but still could not see the mongoDB connection. Sanjay adds an EIP to the EC2 to avoid the IP changing on reboot. Steve adds the new IP address to MongoDB allow list, resolving the issue.
8:22pm – Kashish confirms the fix
8:31pm – Anand posts the following summary:
Investigation Summary
- Anand and Sanjay identified the issue: the database was attempting to connect but encountering a timeout error.
- We engaged Rob for further investigation, and he suggested creating an Elastic IP address to allow access in MongoDB.
- Sanjay created the Elastic IP, and Steve added it to MongoDB Atlas.
- Rob confirmed successful communication.
- Sanjay restarted the support application.
- The database is now connecting successfully.
- The support application is loading categories as expected.
Root Cause Analysis
- No tickets could be submitted on the Support.wwnorton.com website.
- Support EC2 did not have Mongodb connection: why? no restarts happened, timeout in log at what time?
- a. instance was not reaching mongo cluster using interface in mongo allow list, could be AWS maintenance b. Mongo db allow list or other action affected this connection.
Learned
What didn’t we know before this incident that we now know?
- Should we move the support app to EKS instead of running on EC2?
Action Items
- Get logs show Mongodb disconnect time and error message from docker container. Other logs in Graylog and on EC2? Anand & Sanjay
- Identify original IP address for the support site - was this affected by IP address cleanup, think not, but need to confirm? Check in Cloudtrail. - Rob
- Check with MongoDB for actions taken during this time. - Steve
- Discuss pros and cons of moving to EKS - separate meeting
- What actions should be taken if this happens again - once root cause identified.
- Discuss monitoring for the site, mongodb & salesforce connections, need alerts. -- next meeting
- Why are logs not being collected from support site? - check in credential files, need to move to DNS instead of IP? Anand & Sanjay



