Senior Hpc Cluster Support Engineer

2 semanas atrás


São Paulo, Brasil Sky Systems, Inc. Tempo inteiro

Role : HPC Cluster Support – CIBA 4 (Senior) Position Type : Part-Time Contract (20hrs / week) Contract Duration : 6 months Work Hours : EST or PST Location : 100% Remote We're seeking a Senior HPC Cluster Support Engineer to maintain and support large-scale production HPC environments running Bright Cluster Manager and Slurm. This role focuses on cluster operations, hardware troubleshooting, user support, and vendor coordination to ensure uninterrupted high-performance computing workloads. Key Responsibilities Manage and support HPC clusters: job submission issues, queue management, and user troubleshooting Monitor cluster health and resolve node failures, networking issues, and domain problems Diagnose hardware faults (GPUs, boards, power, nodes) and perform remote checks using BMC tools (Dell i DRAC, HPE i LOM, Supermicro). Troubleshoot Infini Band, Panasas storage, and network integration issues. Coordinate repairs and escalate with vendors (Park Place, VDura) Apply system updates, patches, and configurations Collaborate with users and provide regular status updates Required Skills Strong experience with Bright Cluster Manager and Slurm Linux systems administration and advanced troubleshooting Hardware diagnostics, BMC remote management tools Experience with Infini Band, HPC storage systems (Panasas), and vendor escalation Active Directory integration for Linux is a plus #J-18808-Ljbffr



  • São Paulo, Brasil Sky Systems, Inc. (SkySys) Tempo inteiro

    Role : HPC Cluster Support – CIBA 4 (Senior) Position Type : Part-Time Contract (20hrs / week) Contract Duration : 6 months Work Hours : EST or PST Location : 100% Remote We're seeking a Senior HPC Cluster Support Engineer to maintain and support large-scale production HPC environments running Bright Cluster Manager and Slurm. This role focuses on cluster...

  • HPC Software Engineer

    3 semanas atrás


    São Paulo, Brasil Canonical Tempo inteiro

    Join to apply for the HPC Software Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the HPC Software Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough...


  • São Paulo, Brasil beBeeSupport Tempo inteiro

    Senior HPC Cluster Support Engineer This role involves providing high-level technical support for large-scale production HPC environments running Bright Cluster Manager and Slurm. Main responsibilities include cluster administration, troubleshooting user issues, monitoring cluster health, and coordinating with vendors to resolve hardware and software...


  • São Paulo, SP, Brasil beBeeSupport Tempo inteiro

    Senior HPC Cluster Support Engineer This role involves providing high-level technical support for large-scale production HPC environments running Bright Cluster Manager and Slurm. Main responsibilities include cluster administration, troubleshooting user issues, monitoring cluster health, and coordinating with vendors to resolve hardware and software...


  • São Paulo, Brasil Dev.Pro Tempo inteiro

    Senior Site Reliability Engineer - OPS00023 We are a US‑based outsource software development company that has been delivering exceptional software experience to our clients since 2011, helping technology companies to become industry leaders. Over the past few years, we’ve been hiring specialists all over the world while our main development centers were...


  • São Paulo, Brasil Oracle Tempo inteiro

    About the Role As a Principal Cloud Architect, you will be at the forefront of designing and implementing next generation accelerated computing and AI solutions on Oracle Cloud Infrastructure (OCI). You will engage directly with startup to strategic customers, helping them architect and deploy complex HPC and GPU clusters, AI platforms, and intelligent...


  • São Paulo, Brasil Lenovo Tempo inteiro

    Description and Requirements We are Lenovo! We’re a leader in genuine innovation, dreaming up – and building – the technology and services that enable and inspire progress around the world. Our innovative high-quality PCs & Smart Devices, Data Centers, Mobile and Smart Office products are designed and built with the customer in mind. And it’s our...


  • São Paulo, São Paulo, Brasil Dev Tempo inteiro

    We are a US-based outsource software development company that has been delivering exceptional software experience to our clients since 2011, helping technology companies to become industry leaders.Over the past few years, we've been hiring specialists all over the world while our main development centers were in Ukraine. Now, we keep expanding and start...


  • São Paulo, Brasil Workana Tempo inteiro

    Full-time | Remote | CST Coverage We're hiring a Level 3 Engineer to provide advanced technical support and infrastructure expertise for our US-based MSP clients. What You’ll Do Handle escalations from Level 2 engineers. Perform advanced Azure tasks: resource setup, policies, security, scripting, automation. Support enterprise networking: firewalls,...


  • São Paulo, Brasil Kaspersky Tempo inteiro

    Direct message the job poster from Kaspersky Kaspersky has been protecting individuals and corporate clients all over the world from cyber threats for 27 years. We have 400 million unique users, 270 000 corporate clients, 517 products, 1100 technological patents and 34 offices around the world. Today our team has more than 5 000 top level experts, all of...