Administer, install, monitor, and maintain HPC systems, including compute nodes, storage, networking, and software stacks.
Develop and maintain automation tools for system provisioning, configuration management, and monitoring.
Assist in the implementation and management of distributed file systems (e.g., Lustre, BeeGFS, GPFS).
Install, configure, and optimize job scheduling and resource management tools (e.g., Slurm, LSF, PBS).
Assist in system security, patch management, and troubleshooting operational issues.
Contribute to performance benchmarking, system tuning, and capacity planning.
Deploy and maintain commonly used HPC applications and software stacks.
Document system administration procedures and contribute to knowledge-sharing initiatives.
Support researchers by providing technical expertise and resolving escalated support tickets.
Participate in vendor coordination, system procurement, and hardware/software lifecycle management.
Installs, configures, and maintains operating system workstations and servers. Performs software installations and upgrades to operating systems and layered software packages. Monitors and tunes the system to achieve optimum performance levels, acquiring higher-level skills in the process.
Maintains all supporting documentation for comprehensive operating system, hardware and software configuration. Monitors primary responses for information technology related security incidents and violations. Keeps current with new security and network monitoring technologies, applicable laws, and regulations.
Familiarity with high-speed networking (e.g., InfiniBand, Ethernet).
Scripting/programming skills (Python, Bash, or Perl).
Experience configuring, installing and troubleshooting MPI and OpenMP applications.
Experience configuring, installing, tuning and maintaining scientific applications on large-scale systems.
Experience with system automation tools (e.g., Ansible, Puppet).
Experience with system provisioning tools (e.g., xCAT, Confluent, Warewulf, etc).
Knowledge of distributed storage systems (e.g., Lustre, BeeGFS, GPFS).
Experience with containerization (Docker, Singularity, Apptainer).
Experience configuring, installing, maintaining and/or using infrastructure and performance monitoring and optimization tools (such as CheckMK, Grafana, Prometheus, Icinga, etc).
Experience in setting up and executing benchmarks in an HPC environment and analyzing their results systematically.
Preferred Competencies
Ability to work well with faculty and researchers.
Ability to identify and gain expertise in appropriate new technologies and/or software tools.
Ability to understand and translate researchersâ™ scientific goals into technical requirements.
Ability to function as part of an interactive team while demonstrating self-initiative to achieve project's goals and Research Computing Center's mission.
Strong analytical skills, problem-solving ability, attention to detail.
Application Documents
Resume (required)
Cover letter (preferred)
The University of Chicago is an Affirmative Action/Equal Opportunity/Disabled/Veterans Employer and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national or ethnic origin, age, status as an individual with a disability, protected veteran status, genetic information, or other protected classes under the law. For additional information please see the University's Notice of Nondiscrimination.
Staff Job seekers in need of a reasonable accommodation to complete the application process should call 773-702-5800 or submit a request via the Applicant Inquiry Form.
The University of Chicago's Annual Security & Fire Safety Report (Report) provides information about University offices and programs that provide safety support, crime and fire statistics, emergency response and communications plans, and other policies and information. The Report can be accessed online at: securityreport.uchicago.edu. Paper copies of the Report are available, upon request, from the University of Chicago Police Department, 850 E. 61st Street, Chicago, IL 60637.
One of the world's premier academic and research institutions, the University of Chicago has driven new ways of thinking since our 1890 founding. Today, UChicago is an intellectual destination that draws inspired scholars to our Hyde Park and international campuses, keeping UChicago at the nexus of ideas that challenge and change the world.