ITS Services Status
Current Service Interruptions
- RESOLVED: UT-Virtual ESXi Server Crash
Networking and Systems have concluded their review of the UT-V network service outage on Thursday, January 30th, 2014.
Details of the outage, including the timeline, impact, root cause analysis, and lessons learned can be found at: https://wikis.utexas.edu/x/jA95Aw or https://wikis.utexas.edu/display/networking/UT-V+Networking+Failure+Analysis+-+2014-01-30
We apologize for the service interruption and are taking steps to review the system to avoid or reduce the impact of similar problems in the future.
Root Cause Analysis report is underway from ITS Networking.
Steps have been taken to mitigate the issue until the root cause can be addressed. VM owners have been notified of the second crash via the technical contact email addresses on record for the affected VMs.
At this time, there is no indication of service outages associated with this UT-V Host failure.
Affected VMs are identified. Owners of effected VMs are being notified.
Another UT-V ESXi host crashed at approximately 9:28pm.
Likely due to the same issue that was diagnosed this afternoon. VMs hosted by the server have been automatically restarted on other hosts. Affected VM owners will be contacted but they should check their services for potential impact.
The root cause of the server crash has been diagnosed.
Affected VM owners are being identified and contacted.
The server which crashed was in the Commodity2 cluster.
57 VMs were affected
At noon today (Wed. Jan 30) one of our UT-Virtual ESXi servers crashed. Virtual Machines on the affected server went down with the server, the High Availability component restarted them on other servers in the cluster. Other ESXi servers are unaffected. We are compiling a list of affected VMs and opening a ticket with VMware to determine the cause of the crash.
- RESOLVED:Emergency Wireless Network Maintenance
Maintenance is complete.
The maintenance is beginning.
A new maintenance window has been published for Campus Network.
Start: Thursday, March 13, 2014 7:00 PM
End: Thursday, March 13, 2014 8:00 PM
Emergency Wireless Network Maintenance
Networking will be conducting emergency, disruptive maintenance on the DHCP servers supporting the campus wireless network. The maintenance is to restore the failover mode of operation on the servers. (Resiliency
was previously shut off following the outage on 2/13.)
Expected impact: Devices will be unable to obtain IP addresses on the wireless network intermittently throughout the maintenance window (although they will still be able to associate to the wireless network). This affects all “restricted.utexas.edu” and “wifi-help.utexas.edu” wireless networks on campus.
Worst-case impact: Extended loss of DHCP services for wireless networks.
|No maintenance to be scheduled|