The University of Texas at Austin

ITS Services Status

Current Service Interruptions

7/31/2014
11:59 AM
Emergency Maintenance - UDC-B - UPS Maintenance
Unplanned Maintenance

11:59
Greetings All,
          This message is to inform you that at 12:00 we will be putting the UPS at COM 10 into bypass mode to enter another level of trouble shooting on an error that came in this morning for this UPS. This in and of itself will not cause an outage or a problem, but will leave equipment exposed if there is a power event here on campus during the time we have the UPS in bypass mode. There is a potential for an outage however the probability is low. We will send out notification when we are out of this mode.

If you have any questions or concerns with this maintenance, please notify us as soon as possible at udc-fac@its.utexas.edu or contact the UDC Command Center at 512.471.0007.

7/30/2014
6:07 PM
RESOLVED: FC5 Network outage reported
Service Restored

All service has been restored. Future outages to repair the cause for the power loss will be planned.


A temporary power solution is in place, and the switches are all back online. Network access has been restored to FC5, including VoIP.


As of approximately 3:00 PM, the ITS Help Desk has received reports that all VoIP telephony in FC5 is offline. Departmental technical support contacts have been alerted.

7/30/2014
5:11 PM
RESOLVED: Emergency Maintenance: UT-V vCenter Restart
Service Restored

Maintenance completed successfully. vCenter was unavailable less than 5 minutes.


In order to implement a VMware recommended configuration change, UT-V’s production vCenter services will be restarted today at 5:00pm. There will be no service impact to running VMs. However, VM administrators will not be able to manage VMs via vCenter. Services should restored by 5:30pm.

7/29/2014
4:59 PM
On-Going UT-V Storage Issue
Under Investigation

4:58
The UT-V environment has been stable so far today. UT-V staff continue to monitor the issue and prepare for a rollback to ESXi 5.1 if the driver change does not resolve the issue.
Next scheduled update 8/1/2014 at 16:00.


17:00
The final host’s storage controller driver has been changed.
The UT-V environment has been stable so far today. UT-V staff continue to monitor the issue and prepare for a rollback to ESXi 5.1 if the driver change does not resolve the issue.
Next update 7/29/2014 at 17:00


10:01
The UT-V hosts are in the same state as last night. All UT-V hosts have had their storage controller driver changed except for one host in Commodity 2. The final host has only one VM and we will be working with the owner this morning to gracefully migrate the VM so that we can change the driver on the final host.
The Enterprise cluster (with only ITS VMs) did have a period of high storage latency early this morning and a host connection issue that lasted under 1 minute, however, we believe these are different issues at this point and have a case opened with the vendor.
The UT-V team continues to monitor the situation and a new host issue like we have been seeing will trigger a rollback to ESXi 5.1.


7/27/14 11:36 pm
The UT-V team has completed applying storage controller driver changes for all UT-V hosts with one exception in the Commodity 2 cluster. The final host has only one VM running on it that cannot be migrated. It will be addressed tomorrow after reaching out to the VM’s owner.

Next update by 10:00am.


7/27/14 8:38 pm
Vent17 is restored and affected services have been confirmed functional by service owners.

The UT-V team will be doing rolling reboots (transparent) to apply storage controller driver changes in the Commodity 2 and Enterprise clusters to further stabilize the environment. We do not expect any downtime for VMs.


7/27/14 7:51 pm
Vent17 is being restarted. We expect VMs to be back up by 8:15 at the latest.  Service owners should check their VMs for functionality.

Affected services include:
ApplyTx, Austin Disk, Java Mail, MyBenefits, PYPE, UTWeb, Texas Enterprise Directory


7/27/14 7:24 pm
Affected services for Vent12 include:

Apply Texas, DocRepo (Test), JavaMail, UTWeb, MSSQL, Austin Disk


7/27/14 7:16 pm
Vent12 is scheduled to be restarted at 7:15. We expect VMs to be back up by 7:45 at the latest.  Service owners should check their VMs for functionality.

Vent17 is currently scheduled to be restarted at 7:45.


7/27/14 6:15 pm
We are beginning the rollout of the driver change to the other UT-V clusters. The hosts experiencing issues will be restarted after ITS VM administrators have had a chance to attempt graceful VM shutdowns.


7/27/14 5:55 pm
The hosts have now stopped responding in vCenter. It does appear VMs beyond Austin Disk are affected. This cluster contains only ITS VMs.


Two hosts in the UT-V environment appears to be having storage issues with a particular array. ITS Systems staff are investigating.


7/27/14 5:24 pm
Two hosts in the Enterprise cluster (which has not had the HBA driver change applied) are experiencing storage issues. The known impact is limited to Austin Disk Services at this point but may be wider.


July 27, 2014 2:52pm
The HBA drivers have been changed in the Commodity 1 cluster. It continues to operate stably. UT-V staff are monitoring the environment.


July 25, 2014 8:43pm
UT-V staff continue to work with vendor support to resolve this ongoing issue.  The UT-V team is actively working the following plan:

1. Change the default Host Bus Adapter (HBA) driver for the VMware hypervisor in Commodity 1 cluster and reboot all hosts to register the new driver.
2. Observe cluster functionality to determine if HBA driver change has a positive impact on the current issue.
3a. If no further host drop outs in Commodity 1 cluster, change default HBA’s in remaining UT-V clusters.
3b. If host dropouts continue in Commodity 1 cluster, rollback all UT-V clusters to ESXi 5.1, the last stable install base.

Next update Saturday July 26, 2014, 9pm.


July 25, 2014 6:02pm
The UT-V team continues to roll out a configuration change to attempt to stabilize the most problematic cluster.
Next update at 9:00 PM.


July 25, 2014 3:19pm
All VMs affected this afternoon have been powered back on.
UT-V are working to implement a change we hope will mitigate the issue.
Next Update 6:00 PM


July 25, 2014 2:24pm
The host currently affected by the issue will be rebooted at 3:00pm.


July 25, 2014 2:02pm
Another host is experiencing the storage issue. VM admins will be contacted.


July 25, 2014 11:06am
All affected VM’s have been powered up.


July 25, 2014 7:59am
UT-V staff are investigating a current issue with 3 hosts in UT-V.  Some VMs maybe be non-responsive.  UT-V staff are in the process of compiling lists of affected VMs.


UT-V is experiencing ongoing issues with the DDN storage arrays.  VMware and DDN support groups are engaged.  This alert will remain open for the duration of this ongoing issue. 
Next Update EOD July 25, 2014.

7/25/2014
12:23 PM
University Wiki Service intermittant instability
Service Functioning, Temporary Solution in Place

2:10 pm
The University Wikis Service is experiencing intermittent outages. ITS staff continue to monitor and investigate the root cause of the issue but users should expect slow or unresponsive service. 


1:37pm
ITS Staff are investigating.


1:21pm
As of approximately 1:14 PM, ITS Help Desk staff observed that pages on the University Wiki Service were loading slowly (>45 seconds) or not at all.

Past Service Interruptions

Upcoming Maintenance

<<< 07/27/2014 - 08/16/2014 >>>
Sunday Monday Tuesday Wednesday Thursday Friday Saturday
27

12:00 AM - Austin Disk Services Planned Maintenance Window

12:01 AM - SharePoint 2010 Maintenance

7:00 AM - EATON Annual Switchgear Planned - Non-Disruptive Maintenance

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - Update to DMG

28

6:00 PM - Domain Controller Operating System Upgrade

29

7:00 AM - Central Web Authentication maintenance in Beta UT Direct

7:15 AM - Virtual ID Card Non-Disruptive Maintenance

7:15 AM - Student Photo Roster Non-Disruptive Maintenance

7:15 AM - ID Photo Gateway Non-Disruptive Maintenance

5:30 PM - UTLogin Expedited - Non-Disruptive Maintenance

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - SharePoint Maintenance (Tue)

30

8:00 AM - ITS Website Non-Disruptive Maintenance

12:00 PM - Course Instructor Survey non-disruptive maintenance

12:01 PM - CTL's Credit by Exam system maintenance

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - UT Lists planned maintenance

31

6:30 AM - TRAC maintenance

6:30 AM - Two Factor Authentication System Routine Software Updates

5:00 PM - Blog Service Weekly Maintenance

6:00 PM - Domain Controller Operating System Upgrade

1

12:01 PM - CTL's Credit by Exam System Maintenance

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - SharePoint Maintenance (Fri)

2

7:00 AM - EWDE Maintenance

7:00 AM - Weekly planned maintenance for OS patching

8:00 AM - Footprints Maintenance - ITS Incident Management

6:00 PM - Domain Controller Operating System Upgrade

3

12:00 AM - Blackboard Weekly Maintenance

12:00 AM - Austin Disk Services Planned Maintenance Window

12:00 AM - Mainframe maintenance window

12:01 AM - SharePoint 2010 Maintenance

6:00 PM - Domain Controller Operating System Upgrade

4

3:30 PM - PyPE Non-Prod OS Maintenance

6:00 PM - Domain Controller Operating System Upgrade

5

7:00 AM - Monthly JIRA Application Maintenance

7:00 AM - NOC A Generator service

7:00 AM - Central Web Authentication Maintenance on DPDev1

8:00 AM - beta.dp.utexas.edu Utlogin maintenance

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - SharePoint Maintenance (Tue)

10:00 PM - Web Central Biweekly Scheduled Maintenance

6

6:00 AM - Monthly MSSQL Maintenance

8:00 AM - ITS Website Non-Disruptive Maintenance

6:00 PM - Domain Controller Operating System Upgrade

9:30 PM - Document Repository - quarterly PROD host patching

7

5:00 PM - Blog Service Weekly Maintenance

6:00 PM - Domain Controller Operating System Upgrade

8

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - SharePoint Maintenance (Fri)

9

7:00 AM - EWDE Maintenance

7:00 AM - Weekly planned maintenance for OS patching

8:00 AM - Web Central Scheduled Maintenance

6:00 PM - Domain Controller Operating System Upgrade

10

12:00 AM - Blackboard Weekly Maintenance

12:00 AM - Austin Disk Services Planned Maintenance Window

12:01 AM - SharePoint 2010 Maintenance

6:00 PM - Domain Controller Operating System Upgrade

11

6:00 PM - Domain Controller Operating System Upgrade

12

8:00 AM - UTForge quarterly OS maintenance

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - SharePoint Maintenance (Tue)

13

8:00 AM - ITS Website Non-Disruptive Maintenance

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - UT Lists planned maintenance

14

6:30 AM - UTLogin Planned Maintenance

5:00 PM - Blog Service Weekly Maintenance

6:00 PM - Domain Controller Operating System Upgrade

15

6:00 PM - Domain Controller Operating System Upgrade

10:00 PM - SharePoint Maintenance (Fri)

16

7:00 AM - EWDE Maintenance

7:00 AM - Weekly planned maintenance for OS patching


  No maintenance to be scheduled
  Today's date

We Can Help

Get help from an expert:

* ITS Help and Service Desk

* Call us at 512-475-9400

* Submit a help request online

We also have a walk-in service in the first floor lobby of the Flawn Academic Center (FAC). Stop by and let us help you!