HPC Solution Architect (High performance computing)

18 Feb 2025

Vacancy expired!

Role : UNIX ,Windows & VMWare AdminLocation : Austin / TX 100% Onsite/Hybrid Job Description :Project Details Responsible for architecting and implementing Linux High Performance Computing (HPC) clusters. Performs system architecture duties on a Linux High performance computing (HPC) cluster including cluster management, virtualization, cluster usage monitoring, health monitoring, job scheduling, application integration/installation (open source as well as vendor supported), and application performance. Improve cluster performance through kernel changes, firmware updates, library stack changes, and application container management such as docker. Overall Experience 10-12 yrs Domain Experience If applicable Semi Conductor Mandatory Skills and Technologies, framework, and Methodologies

  • HPC Engineering solutions architect with cloud knowledge.
  • Knowledge of Linux and UNIX operating systems, including scripting and programming proficiencies.
  • Experience with cloud bursting technologies.
  • Knowledge of cloud services like AWS SCOCA, Parallel Cluster, and Azure CycleCloud
  • Knowledge of HPC tools and storage: AWS Elastic Fabric Adapter, Azure ANF, Apache Spark, or Apache Ignite, Lustre, BeeFS
  • Demonstrate experience in programming system maintenance tasks in C, Java, Perl, batch/shell, or another general-purpose programming language.
  • Knowledge of NUMA and understanding of NUMA related APIs.
  • Be able to perform complex performance analysis including system processes, I/O subsystems, networks and other related components.
  • Must have experience with multi-threading and parallel processing tools and environments.
  • Must have experience as a systems administrator. Must have advanced ability to analyze complex IT systems.
  • Experience with high-performance servers and associated high-performance networks.
  • Experience installing and maintaining clustered environments, including automated installation methods.
  • Knowledge of common server hardware architectures including servers (CPU, bus, memory), SANS, disk arrays, network hardware.
  • Understanding of Red Hat Linux Operating system including processes, files, memory management and I/O systems; networking services and protocols (e.g., TCP/IP, SSL, FTP, Telnet, LDAP).
  • Understanding of IP networking, basic routing, TCP ports and network services, including SSH, LDAP, SFTP and HTTP(S). Ability to design, promote, and implement change control and configuration management, patch management, high availability systems, structured design and support methodologies.
  • Must be organized with a strong ability to deliver tasks on time, manage multiple efforts and be able to work with minimal supervision.
Demonstrated ability to proactively learn, adapt to and use new hardware/software technologies.

  • ID: #49273617
  • State: Texas Austin 73301 Austin USA
  • City: Austin
  • Salary: Depends on Experience
  • Job type: Contract
  • Showed: 2023-02-18
  • Deadline: 2023-04-15
  • Category: Et cetera