The University of Texas at Austin

ITS Services Status

Current Service Interruptions

7/28/2014
5:07 PM
On-Going UT-V Storage Issue
Under Investigation

17:00
The final host’s storage controller driver has been changed.
The UT-V environment has been stable so far today. UT-V staff continue to monitor the issue and prepare for a rollback to ESXi 5.1 if the driver change does not resolve the issue.
Next update 7/29/2014 at 17:00


10:01
The UT-V hosts are in the same state as last night. All UT-V hosts have had their storage controller driver changed except for one host in Commodity 2. The final host has only one VM and we will be working with the owner this morning to gracefully migrate the VM so that we can change the driver on the final host.
The Enterprise cluster (with only ITS VMs) did have a period of high storage latency early this morning and a host connection issue that lasted under 1 minute, however, we believe these are different issues at this point and have a case opened with the vendor.
The UT-V team continues to monitor the situation and a new host issue like we have been seeing will trigger a rollback to ESXi 5.1.


7/27/14 11:36 pm
The UT-V team has completed applying storage controller driver changes for all UT-V hosts with one exception in the Commodity 2 cluster. The final host has only one VM running on it that cannot be migrated. It will be addressed tomorrow after reaching out to the VM’s owner.

Next update by 10:00am.


7/27/14 8:38 pm
Vent17 is restored and affected services have been confirmed functional by service owners.

The UT-V team will be doing rolling reboots (transparent) to apply storage controller driver changes in the Commodity 2 and Enterprise clusters to further stabilize the environment. We do not expect any downtime for VMs.


7/27/14 7:51 pm
Vent17 is being restarted. We expect VMs to be back up by 8:15 at the latest.  Service owners should check their VMs for functionality.

Affected services include:
ApplyTx, Austin Disk, Java Mail, MyBenefits, PYPE, UTWeb, Texas Enterprise Directory


7/27/14 7:24 pm
Affected services for Vent12 include:

Apply Texas, DocRepo (Test), JavaMail, UTWeb, MSSQL, Austin Disk


7/27/14 7:16 pm
Vent12 is scheduled to be restarted at 7:15. We expect VMs to be back up by 7:45 at the latest.  Service owners should check their VMs for functionality.

Vent17 is currently scheduled to be restarted at 7:45.


7/27/14 6:15 pm
We are beginning the rollout of the driver change to the other UT-V clusters. The hosts experiencing issues will be restarted after ITS VM administrators have had a chance to attempt graceful VM shutdowns.


7/27/14 5:55 pm
The hosts have now stopped responding in vCenter. It does appear VMs beyond Austin Disk are affected. This cluster contains only ITS VMs.


Two hosts in the UT-V environment appears to be having storage issues with a particular array. ITS Systems staff are investigating.


7/27/14 5:24 pm
Two hosts in the Enterprise cluster (which has not had the HBA driver change applied) are experiencing storage issues. The known impact is limited to Austin Disk Services at this point but may be wider.


July 27, 2014 2:52pm
The HBA drivers have been changed in the Commodity 1 cluster. It continues to operate stably. UT-V staff are monitoring the environment.


July 25, 2014 8:43pm
UT-V staff continue to work with vendor support to resolve this ongoing issue.  The UT-V team is actively working the following plan:

1. Change the default Host Bus Adapter (HBA) driver for the VMware hypervisor in Commodity 1 cluster and reboot all hosts to register the new driver.
2. Observe cluster functionality to determine if HBA driver change has a positive impact on the current issue.
3a. If no further host drop outs in Commodity 1 cluster, change default HBA’s in remaining UT-V clusters.
3b. If host dropouts continue in Commodity 1 cluster, rollback all UT-V clusters to ESXi 5.1, the last stable install base.

Next update Saturday July 26, 2014, 9pm.


July 25, 2014 6:02pm
The UT-V team continues to roll out a configuration change to attempt to stabilize the most problematic cluster.
Next update at 9:00 PM.


July 25, 2014 3:19pm
All VMs affected this afternoon have been powered back on.
UT-V are working to implement a change we hope will mitigate the issue.
Next Update 6:00 PM


July 25, 2014 2:24pm
The host currently affected by the issue will be rebooted at 3:00pm.


July 25, 2014 2:02pm
Another host is experiencing the storage issue. VM admins will be contacted.


July 25, 2014 11:06am
All affected VM’s have been powered up.


July 25, 2014 7:59am
UT-V staff are investigating a current issue with 3 hosts in UT-V.  Some VMs maybe be non-responsive.  UT-V staff are in the process of compiling lists of affected VMs.


UT-V is experiencing ongoing issues with the DDN storage arrays.  VMware and DDN support groups are engaged.  This alert will remain open for the duration of this ongoing issue. 
Next Update EOD July 25, 2014.

7/28/2014
11:26 AM
RESOLVED:Austin Disk - WebDAV performance
Service Restored

1119
The aforementioned servers have been rebooted and are functioning normally.  ITS systems will continue to monitor WebDAV performance metrics.


10:15
To resolve WebDAV performance issues, a rolling restart of Austin Disk WebDAV frontend servers will be initiated at 11:00 AM.  Customer impact should be minimal due to the redundant nature of the frontends.

7/25/2014
12:23 PM
University Wiki Service intermittant instability
Service Functioning, Temporary Solution in Place

2:10 pm
The University Wikis Service is experiencing intermittent outages. ITS staff continue to monitor and investigate the root cause of the issue but users should expect slow or unresponsive service. 


1:37pm
ITS Staff are investigating.


1:21pm
As of approximately 1:14 PM, ITS Help Desk staff observed that pages on the University Wiki Service were loading slowly (>45 seconds) or not at all.

Past Service Interruptions

Upcoming Maintenance

<<< 12/16/2012 - 01/05/2013 >>>
12:00 AM - 6:00 AM
Blackboard Weekly Maintenance
Service: Blackboard
Description: Weekly maintenance event for Blackboard (courses.utexas.edu).

The service may be degraded or down during this time.
12:01 AM - 6:00 AM
Austin Disk Services Planned Maintenance Window
Service: Austin Disk Services
Description: Customers utilizing the following dependent services may be affected:

* Austin Disk shares (user home directories and departmental shares)
* Stat Apps Server (user profiles, redirected documents ... (more)
12:01 AM - 6:00 AM
SharePoint Maintenance Window
Service: SharePoint
Description: Service availability may be intermittent during the maintenance window.
10:00 PM - 12:00 AM
Monthly Group E-Mail Maintenance
Service: Group E-Mail
Description: Monthly maintenance window for the Group E-mail server.
7:00 AM - 7:30 AM
Planned Maintenance for Central Web Authentication in Qual and Beta
Service: Central Web Authentication (CWA)
Description: Central Web Authentication will undergo routine scheduled maintenance in the Qual and Beta environments. Users should experience no interruption of service.
7:00 AM - 8:00 AM
Pype Network Maintenance Dry-run
Service: Python Production Environment (PyPE)
Description: The Pype service in TEST and the deployment interface will be unavailable from approximately 7am up to 8am as the Pype team tests their shutdown procedure for the January 6th network maintenance. ... (more)
8:00 AM - 9:00 AM
Quarterly JIRA OS Maintenance
Service: Jira
Description: JIRA will undergo regular operating system (OS) maintenance on the third Tuesday of every third month.
8:00 AM - 8:30 AM
Student Photo Roster Planned Maintenance
Service: Student Photo Roster
Description: On Tuesday, December 18th from 8:00 - 8:30am the Student Photo Roster will undergo routine scheduled maintenance. Users should experience no interruption in service.
8:00 AM - 8:30 AM
Virtual ID Cards Planned Maintenance
Service: Virtual ID Cards
Description: On Tuesday, December 18th from 8:00 - 8:30am the Virtual ID Cards will undergo routine scheduled maintenance. Users should experience no interruption in service.
8:00 AM - 8:30 AM
ID Photo Gateway Planned Maintenance
Service: ID Photo Gateway
Description: On Tuesday, December 18th from 8:00 - 8:30am the ID Photo Gateway will undergo routine scheduled maintenance. Users should experience no interruption in service.
6:00 PM - 10:00 PM
Wiki Service Monthly Maintenance
Description: Scheduled maintenance for the University Wiki Service occurs the third Tuesday of the month from 6 p.m. - 10 p.m. To the maximum extent possible, installation of service, application and security ... (more)
9:00 AM - 5:00 PM
Update vSphere 5 to latest revision
Service: Virtual Servers
Description: Update the vSphere ESXi 5 to the latest ESXi patch revisions.

No service interruptions are anticipated.
1:30 PM - 2:30 PM
UTmail Signup Portal maintenance
Service: UTmail
Description: ITS will be performing maintenance on the UTmail signup portal. No interruption in service is expected.
6:00 PM - 11:59 PM
Blog Service Weekly Maintenance
Service: University Blog Service
Description: Scheduled maintenance may occur weekly on Wednesdays between 6 p.m. and midnight. To the maximum extent possible, installation of service, application, and security updates will be performed during ... (more)
9:00 PM - 10:00 PM
Patching of Oracle Central RAC (OAASPROD)
Service: Oracle
Description: Install an Oracle database patch on OAASPROD in a rolling fashion. No service interruption is expected.
8:00 AM - 9:00 AM
UTForge Maintenance
Service: UTForge
Description: Security patches will be applied to the OS during this maintenance event.
Subversion will be available throughout the maintenance window.
Trac will be unavailable for a portion of the event.
9:00 AM - 5:00 PM
Update vSphere 5 to latest revision
Service: Virtual Servers
Description: Update the vSphere ESXi 5 to the latest ESXi patch revisions.

No service interruptions are anticipated.
10:30 PM - 11:45 PM
Emergency Maintenance - JIRA - Cat I MySQL
Service: Jira
Description: All instances of JIRA will be unavailable tonight 12/20/2012 from 10:30 PM to 11:45 PM due to the emergency maintenance event for Category I MySQL database.
9:00 AM - 5:00 PM
Update vSphere 5 to latest revision
Service: Virtual Servers
Description: Update the vSphere ESXi 5 to the latest ESXi patch revisions.

No service interruptions are anticipated.
12:01 AM - 6:00 AM
SharePoint Maintenance Window
Service: SharePoint
Description: Service availability may be intermittent during the maintenance window.
4:00 PM - 5:00 PM
ERP Web Infrastructure Maintenance Window
Service: ERP UI Web Infrastructure
Description: ERP UI, ERP Application Registry, ERP UI Site generator application upgrades occur during this time. Minimal impact on users should be expected.
12:01 AM - 6:00 AM
Austin Disk Services Planned Maintenance Window
Service: Austin Disk Services
Description: Customers utilizing the following dependent services may be affected:

* Austin Disk shares (user home directories and departmental shares)
* Stat Apps Server (user profiles, redirected documents ... (more)
12:01 AM - 6:00 AM
SharePoint Maintenance Window
Service: SharePoint
Description: Service availability may be intermittent during the maintenance window.
9:00 AM - 12:00 PM
uTexas Enterprise Directory (TED) routine maintenance.
Service: uTexas Enterprise Directory (TED)
Description: Routine maintenance on TED (The uTexas Enterprise Directory) will be done at this time. No interruption of service is anticipated.
6:00 PM - 11:59 PM
Blog Service Weekly Maintenance
Service: University Blog Service
Description: Scheduled maintenance may occur weekly on Wednesdays between 6 p.m. and midnight. To the maximum extent possible, installation of service, application, and security updates will be performed during ... (more)
6:00 AM - 8:00 AM
Network Maintenance for NFS file servers
Description: We will be performing network maintenance on the StoCon file servers in order to remedy existing issues with these file servers' network configuration. List of affected services includes:

* ... (more)
6:00 AM - 8:00 AM
UTForge Maintenance
Service: UTForge
Description: UTForge will be unavailable due to network maintenance.
6:00 AM - 7:00 AM
uTexas Identity Manager (TIM) PROD and TEST Maintenance
Service: uTexas Identity Manager (TIM)
Description: In order to complete today's NFS file server maintenance, we will be conducting maintenance on TIM PROD and TEST. An approximate 10 minute outage for each service is expected.
10:00 AM - 5:00 PM
7:00 AM - 10:00 AM
SecureDoc service and server maintenance
Service: Enterprise Whole Disk Encryption
Description: Recurring maintenance window for SecureDoc services.
8:00 AM - 12:00 PM
Web Central Maintenance
Service: Web Central
Description: The Web servers hosting www.utexas.edu and Unix-based custom Web hosting sites will undergo routine maintenance. No customer visible impact is anticipated.

We Can Help

Get help from an expert:

* ITS Help and Service Desk

* Call us at 512-475-9400

* Submit a help request online

We also have a walk-in service in the first floor lobby of the Flawn Academic Center (FAC). Stop by and let us help you!