IT Administrator III, HPC System Administrator The University of Tennessee, Knoxville Office of Information Technology, High Performance and Scientific Computing Group
The Office of Information Technology at The University of Tennessee Knoxville is seeking qualified applicants for a System Administrator position which will perform a key role in evaluating, deploying, maintaining, securing, and operating the research cyberinfrastructure resources used to support the research mission of the University. OIT operates computing resources and provides support to University researchers and their collaborators giving them the capability to investigate and solve cutting-edge and computationally challenging problems. Under the guidance of the Director, High Performance and Scientific Computing, this individual will have an in-depth knowledge of system administration and be part of a team that manages centralized high performance computing and storage resources, a secure research computing environment, and works with faculty and researchers to make effective use of the resources for research.
Major Duties/Responsibilities The successful candidate will have an in-depth level skill set, perform advanced analysis, and provide advanced problem solving to maintain and administer the research cyberinfrastructure resources used to support the research mission of the University. The responsibilities of the System Administrator includes, but is not limited to: coordinating the configuration and maintenance of the storage resources, including hardware and software; configure and maintain server-class, HPC resources, including hardware, OS software, application software installed with the OS package management tool, and resource management system; supporting the Senior System Administrator with maintaining the computational resources; maintaining an effective security posture including implementing the required security controls specified in the system security plan; diagnosing and resolving hardware, software, networking, and system issues when they arise; monitoring services and responding to service failures; implementing configuration management processes and procedures; providing technical support and user support as needed; documenting processes and procedures to administer and maintain the storage and computational systems and infrastructure; supporting the research cyberinfrastructure portal and database; ensuring required backups are performed regularly and successfully; and working with other OIT groups, such as, Networking Services, Help Desk, and Systems to support the continued and efficient operation of the storage and computational and infrastructure resources.
Internal Number: 199907
Our primary mission is to move forward the frontiers of human knowledge and enrich and elevate the citizens of the state of Tennessee, the nation, and the world. As the preeminent research-based, land-grant university in the state, UT embodies the spirit of excellence in teaching, research, scholarship, creative activity, outreach, and engagement attained by the nation’s finest public research institutions.