Senior System Administrator

Seattle, WA

The Institute for Health Metrics and Evaluation (IHME) is an independent research center at the University of Washington. Its mission is to monitor global health conditions and health systems, as well as to evaluate interventions, initiatives, and reforms. IHME carries out a range of projects within different research areas including the Global Burden of Diseases, Injuries, and Risk Factors; Future Health Scenarios; Costs and Cost Effectiveness; Local Burden of Disease; Resource Tracking; and Impact Evaluations. The aim is to provide policymakers, donors, and researchers with the highest-quality quantitative evidence base to make decisions that achieve better health. 

IHME has an outstanding opportunity for a Senior System Administrator. The Senior System Administrator will be responsible for assisting with the day-to-day administration of all internet-facing systems, storage management as well as computational hardware used to support of the Institute’s mission. Additionally, this position will help develop and support forecasting infrastructure. 

Technical Responsibilities Include:

  • Engineering and administration of all Linux/other server systems, including Linux cluster system (RedHat Enterprise 6.x, Ubuntu LTS) 
  • Assist with the development of the new health data forecasting systems 
  • Management of Internet-facing systems, including Apache, NGINX, IIS, Server Load Balancers, and other related systems 
  • Manage VMware ESX/ESXi and Vcenter product family 
  • Assist with architecture, implementation and managing storage systems, including SAN and NAS devices 
  • Implementation and on-going administration of monitoring and alerting systems, create periodic uptime and problem reports 
  • Capacity management and planning for Development / Test / Production systems 
  • Developing and deploying services via container technologies (Docker, Rancher, Mesos, Marathon, Aurora) 
  • Manage co-location, racking systems, cabling 
  • Manage domain name registration and DNS systems 
  • Manage Backups and Disaster Recovery for Systems 
  • Participate in 24X7 on-call rotation response to production system issues with other IHME-IT team members 
  • Create a library of system documentation, provide training for team members and interns 
  • Working with the IHME-IT team, determine or perform the following:  - Data Access requirements for internal and external customers  - Requirements for uptime, performance, and speed-of-delivery for content and datasets  - Data storage requirements, including projected growth and backup configuration  - Management and support of the IHME Global Health Data Exchange  - Create and Implement business continuity practices  - Assist with the planning and cost-effective purchasing of hardware, software and services 
  • Cost-effective and reliable vendor selection for hardware, software and services 

Requirements: 

  • Bachelor's degree in Computer Science, Management Information Systems or a related field or concentration, or a Bachelor’s degree in and significant experience with social sciences, or equivalent experience 
  • Minimum 2 years’ experience with Unix / Linux 
  • Minimum 2 years’ experience with VMware Server environment. 

Additional Requirements 

  • Outstanding interpersonal skills, including team ethic and relationship building 
  • Excellent written and verbal communication skills, exceptional decision-making skills 
  • Self-starter and demonstrated ability to learn/adapt new methods to support this role 
  • Experience in security system design and engineering 
  • Experience working with third-party vendors for hardware and software procurement 
  • Ability to work under and deliver quality systems constant deadline pressures 
  • Periodic support with Microsoft Windows Server and Active Directory administration 
  • Occasional end-user support 

Equivalent education/experience will substitute for all minimum qualifications except when there are legal requirements, such as a license/certification/registration. 

Desired

  • Experience with clustered computer systems 
  • Windows Server 
  • Demonstrated track record of innovative solutions deploying and supporting IT initiatives 
  • Interest in the promotion of global health 
  • Knowledge of cluster management and scheduling systems like SGE/UGE or Torque 
  • Experience with Quantum StorNext distributed file systems a plus 
  • Layer 2 & 3 network experience 
  • Familiarity with supporting Java, Python, PHP, reusable code 
  • Familiarity with supporting MySQL or PostgreSQL databases 
  • A deep understanding of security and systems best practices 

Conditions of Employment:

  • Expected to participate in 24X7 on-call rotation response to production system issues with other IHME-IT team members. 

Get weekly notifications when new jobs are posted