1. Provides technical expertise in resolving user system deficiencies and determines appropriate action. 2. Provides system services and analyze system performance for stakeholders and intended end users. Performs all activities necessary to activate a new operating system or new release of an existing system, including analysis, design, implementation, and related documentation. Analyzes systems performance and modifies programs to increase the efficiency of the operation. Reinstates integrity of system as quickly as possible following an outage in order to minimize item and data loss. 3. Recommends and authorizes system upgrades and software installations. 4. Designs, develops and implements new system tools. 5. Analyzes execution time of commonly used instruction to identify and replaces those that are inefficient or slow to operation. 6. Analyzes, evaluates and takes steps to circumvent problems and restores systems to operating condition. 7. Contributes in the determination of specifications and determines the combination of options needed to tailor an operating system to meet the business needs. 8. Conducts training and user education. 9. Researches new technologies, processes, and methodologies. 10. May perform other duties as assigned.
Preferred Education: Experience with HPC clusters, preferably with administration thereof. Extensive knowledge of clustering tools, e.g., Slurm, xcat. Experience with technology in a research environment. Expertise with high-speed networking such as InfiniBand and 10/40 Gigabit ethernet. Familiarity with large storage systems and parallel file systems such as GPFS and Lustre.
Preferred Education, Experience and Skills: Experience with HPC clusters, preferably with administration thereof. Extensive knowledge of clustering tools, e.g., Slurm, xcat. Experience with technology in a research environment. Expertise with high-speed networking such as InfiniBand and 10/40 Gigabit ethernet. Familiarity with large storage systems and parallel file systems such as GPFS and Lustre.
Required Skill/ability 5: Attention to detail with the proven ability to take the care necessary to be entrusted with a system that hundreds of users depend on for research computation and the storage of research data.
Posting Position Title: Operating Systems Programmer
Required Skill/ability 3: Proven ability to work in team environment in fast-moving technology field.
Work Week: Standard (M-F equal number of hours per day)
University Job Title: High-Performance Computing System Administrator
Required Skill/ability 1: Proven expertise with Linux operating system distributions.
Required Skill/ability 4: Excellent verbal and writing skills. Ability to interact well with team members and end users. Ability to work independently and across units.
Required Skill/ability 2: Expertise with bash and at least one other scripting language. Demonstrated expertise with Linux system administration, including OS, networking, storage, and security.
Bachelor's Degree in a related field and a minimum of two years of related work experience or an equivalent combination of education and experience.
Internal Number: 58291BR
About Yale University
Yale University is an American private Ivy League research university located in New Haven, Connecticut. Founded in 1701 in the Colony of Connecticut, the university is the third-oldest institution of higher education in the United States.