Senior HPC Cluster Support Engineer

Há 2 dias


são paulo, Brasil Sky Systems, Inc. (SkySys) Tempo inteiro

Role: HPC Cluster Support – CIBA 4 (Senior) Position Type: Part-Time Contract (20hrs/week) Contract Duration: 6 months Work Hours: EST or PST Location : 100% Remote We're seeking a Senior HPC Cluster Support Engineer to maintain and support large-scale production HPC environments running Bright Cluster Manager and Slurm . This role focuses on cluster operations, hardware troubleshooting, user support, and vendor coordination to ensure uninterrupted high-performance computing workloads. Key Responsibilities Manage and support HPC clusters: job submission issues, queue management, and user troubleshooting Monitor cluster health and resolve node failures, networking issues, and domain problems Diagnose hardware faults (GPUs, boards, power, nodes) and perform remote checks using BMC tools (Dell iDRAC, HPE iLOM, Supermicro) Troubleshoot InfiniBand , Panasas storage, and network integration issues Coordinate repairs and escalate with vendors (ParkPlace, VDura) Apply system updates, patches, and configurations Collaborate with users and provide regular status updates Required Skills Strong experience with Bright Cluster Manager and Slurm Linux systems administration and advanced troubleshooting Hardware diagnostics, BMC remote management tools Experience with InfiniBand , HPC storage systems (Panasas), and vendor escalation Active Directory integration for Linux is a plus


  • HPC Software Engineer

    2 semanas atrás


    São Paulo, Brasil Canonical Tempo inteiro

    Join to apply for the HPC Software Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the HPC Software Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough...

  • HPC Software Engineer

    Há 7 horas


    São Paulo, Brasil Canonical Tempo inteiro

    Join to apply for the HPC Software Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the HPC Software Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough...


  • São Paulo, Brasil beBeeSupport Tempo inteiro

    Senior HPC Cluster Support Engineer This role involves providing high-level technical support for large-scale production HPC environments running Bright Cluster Manager and Slurm. Main responsibilities include cluster administration, troubleshooting user issues, monitoring cluster health, and coordinating with vendors to resolve hardware and software...


  • São Paulo, SP, Brasil beBeeSupport Tempo inteiro

    Senior HPC Cluster Support Engineer This role involves providing high-level technical support for large-scale production HPC environments running Bright Cluster Manager and Slurm. Main responsibilities include cluster administration, troubleshooting user issues, monitoring cluster health, and coordinating with vendors to resolve hardware and software...

  • HPC System Administrator

    1 semana atrás


    São Paulo, Brasil Lenovo Tempo inteiro

    HPC System Administrator – Lenovo We are currently hiring for an HPC System Administrator to work onsite for a customer based in Rio de Janeiro, Brazil. Responsibilities Monitoring, maintaining, and managing the physical infrastructure of a data center, ensuring its smooth operation, reliability, and security. Monitoring power, cooling systems, and network...


  • São Paulo, Brasil Dev.Pro Tempo inteiro

    Senior Site Reliability Engineer - OPS00023 We are a US‑based outsource software development company that has been delivering exceptional software experience to our clients since 2011, helping technology companies to become industry leaders. Over the past few years, we’ve been hiring specialists all over the world while our main development centers were...


  • São Paulo, Brasil Dev PRO Tempo inteiro

    We are a US-based outsource software development company that has been delivering exceptional software experience to our clients since 2011, helping technology companies to become industry leaders. Over the past few years, we’ve been hiring specialists all over the world while our main development centers were in Ukraine. Now, we keep expanding and start...


  • São Paulo, Brasil Dev.Pro Tempo inteiro

    We are a US-based outsource software development company that has been delivering exceptional software experience to our clients since ****, helping technology companies to become industry leaders.Over the past few years, we've been hiring specialists all over the world while our main development centers were in Ukraine.Now, we keep expanding and start...


  • São Paulo, Brasil Dev.Pro Tempo inteiro

    We are a US-based outsource software development company that has been delivering exceptional software experience to our clients since 2011, helping technology companies to become industry leaders. Over the past few years, we’ve been hiring specialists all over the world while our main development centers were in Ukraine. Now, we keep expanding and start...


  • São Paulo, Brasil Oracle Tempo inteiro

    About the Role As a Principal Cloud Architect, you will be at the forefront of designing and implementing next generation accelerated computing and AI solutions on Oracle Cloud Infrastructure (OCI). You will engage directly with startup to strategic customers, helping them architect and deploy complex HPC and GPU clusters, AI platforms, and intelligent...