Vacancy expired!
Job Posting Description
Research Computing supports basic, translational, and clinical research by providing researchers with access to digital tools and technologies. Our computing environment consists of high-performance computing clusters, virtual machines, PetaByte-scale high-performance storage, and cloud platforms. We work with BCH scientists to develop solutions in support of cutting-edge research ranging from lattice light-sheet microscopy to CryoEM, genomics, and informatics.The Research Computing DevOps Team supports the HPC team with automation efforts to help improve our processes. This is accomplished with automation tools such as Ansible and programming in Python, but also through looking at our big picture processes and looking for ways to streamline. The team manages 2 HPC clusters used by approximately 500 researchers with a variety of technical backgrounds. These researchers support emerging genomics, structural biology, and radiology research. We have hardware in compute, GPU, and FPGA nodes totalling over 4k compute threads and 20TB of RAM. We also maintain virtual machines and storage clusters to back up our compute infrastructure.The Senior DevOps Infrastructure Engineer shall be responsible for:- Designing, maintaining, and supporting technical infrastructure for our HPC. Working with research projects to support their HPC needs. Working on automation and applying infrastructure as code principles to ease the operational burden and help with customer/researcher pain points. Promoting technological best practices across the research organization and helping colleagues align with those standards.
- Routinely leading, co-leading, or participating in complex DevOps projects; setting goals and objectives for projects and demonstrating achievement of those goals and objectives; coordinating work activities with other stakeholders.
- Training staff and researchers; effectively tailoring presentations; developing, implementing, and maintaining knowledge management systems.
- Creating or contributing to a range of compelling communications that clearly deliver even the most complex content and motivate action; preparing communications appropriate for management and internal distributions; reviewing and providing feedback on documents prepared by staff.
- Presenting at project and departmental meetings; effectively conveying progress and asserting point of view; constructively discussing issues and providing facts; building credibility and trust by asking thoughtful questions and actively listening; running productive project meetings that advance problem-solving.
- 4-year STEM Bachelor's degree or 4-years of STEM experience.
- A minimum of 1 years of position-specific experience (might potentially include experience acquired through MSc or Ph.D. studies).
- Must have experience with infrastructure as code (e.g., Ansible, Puppet, Chef or Salt), Ansible preferred. Must have experience working with version control systems such as Git. Must have experience with Linux administration and with general purpose programming languages such as Python and Bash. Must have experience with HPC environments and workload managers (e.g., Slurm).
- Knowledge of AWS infrastructure and IaC languages such as Terraform or Cloudformation is a plus.
- Ability to resolve a wide range of complex assignments. Ability to routinely lead complex projects and coordinate project teams; readiness to seek advice and guidance when needed and to operate effectively in collaboration with scientists and clinicians; ability to follow, improve and create technical documentation and standard operating procedures.
- Effectively convey messages through written communication that are tailored to target audience.
- Passion for new technologies and science.
- ID: #49277809
- State: Massachusetts Boston 02108 Boston USA
- City: Boston
- Salary: USD TBD TBD
- Job type: Permanent
- Showed: 2023-02-18
- Deadline: 2023-04-18
- Category: Et cetera