Site Reliability Architect

Há 6 dias


Fortaleza, Ceará, Brasil beBeeInfrastructure Tempo inteiro US$90.000 - US$120.000

Reliable System Engineer

About the Role

We are seeking a seasoned Reliable System Engineer to join our team. As a key member, you will be responsible for designing, implementing, and maintaining highly available and scalable systems.

Key Responsibilities:

  • Develop infrastructure architecture, automation, and intelligent monitoring systems from design through implementation.
  • Operate, maintain, and administer solutions contributing to customer infrastructure's operational efficiency, availability, and visibility.
  • Plan maintenance activities, design documentation, and standard procedures.
  • Provide Root Cause Analysis reports for outages/incidents (ITIL - Problem Management).
  • Observe and provide feedback on the current state of the client's infrastructure, and identify opportunities to improve resiliency, reduce incident occurrence, and automate repetitive administrative and operational tasks.
  • Contribute to, improve, and maintain team documentation about client systems and infrastructure, procedures, policies, and schedules.
  • Collaborate with teammates to contribute to the continuous improvement of our working culture.

Requirements:

  • Experience working with Google and AWS Clouds (including infrastructure as code deployment with Cloud Formation, Terraform, OpsWorks, etc)
  • Scripting and automation of administrative tasks using Python and Scala is a must
  • Solid understanding of microservices architecture and container technologies (Kubernetes is a must, Docker, LXC, etc)
  • Clear understanding of software development lifecycles and best practices from an infrastructure point of view (PRs, merge, rebase, etc)
  • Understanding the end-to-end operations of a 'Business System' vs components.
  • Comprehensive systems hardware and network troubleshooting experience
  • Common Linux distribution platform installation, configuration, performance tuning, and cloud migration.
  • TCP/IP networking, NIC bonding, and network services configuration (DNS, NTP, DHCP, SMTP, etc)
  • Operation and administration of virtual infrastructure, including experience with at least one hypervisor (VMware, Hyper-V, KVM, etc.)
  • Ability to describe IaaS, PaaS, SaaS, pros and cons of each, use cases for virtualization and cloud
  • Administration of web servers and supporting technologies, including network load balancers
  • Experience with the design, development, and deployment of Puppet
  • System and application error investigation, troubleshooting of access/availability issues including deep multi-system root cause analysis
  • Experience managing networking devices, such as switches and firewalls from a variety of vendors
  • Solid understanding of DevOps tools, processes, and culture
  • Ability to pick up new technologies quickly
  • Ability to provide accurate work scheduling and task estimations for work delivery
What We Offer

Attractive total rewards package; blog during work hours; take a day off and volunteer for your favorite charity.

Flexible remote work arrangement, no daily travel requirement to an office All you need is a stable internet connection.

Collaborate with some of the best and brightest in the industry.

Substantial training allowance; participate in professional development days, attend training, become certified, whatever you like.

Annual budget to personalize your work environment.

Annual wellness budget to make yourself a priority (use it on gym memberships, massages, fitness and more). Additionally, generous paid vacation and sick days, as well as a day off to volunteer for your favorite charity.

Important Details

Seniority level: Mid-Senior level.

Employment type: Full-time.

Job function: Information Technology.

Industries: IT Services and IT Consulting.



  • Fortaleza, Ceará, Brasil beBeeExpert Tempo inteiro US$120.000 - US$160.000

    Job OverviewSite Reliability Engineer — Remote Opportunity. One available position for the following skill set:Why This RoleThis is a unique chance to join a dynamic team of experts in strategic database and analytics services, driving digital transformation and operational excellence.We empower organizations to leverage advanced technologies, including...

  • Site Reliability Engineer

    4 semanas atrás


    Fortaleza, Ceará, Brasil BairesDev Tempo inteiro

    Site Reliability Engineer - Remote Work:At BairesDev, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley.Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact...

  • Site Reliability Engineer

    2 semanas atrás


    Fortaleza, Ceará, Brasil BairesDev Tempo inteiro

    Site Reliability Engineer - Remote Work: At BairesDev, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact...

  • Site Reliability Engineer

    3 semanas atrás


    Fortaleza, Ceará, Brasil Personetics Tempo inteiro

    About the companyPersonetics is shaping the Cognitive Banking era, harnessing AI to help banks anticipate customer needs, provide actionable insights, and deliver intelligent financial guidance. Our platform continuously analyzes and leverages real-time transactional data, enabling banks to proactively support customers in managing their finances and...


  • Fortaleza, Ceará, Brasil Pythian Tempo inteiro

    OverviewLinux Site Reliability Consultant — Brazil | Remote | Work from Home. One available position for the following time zone: PST.Why PythianAt Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by ensuring the...


  • Fortaleza, Ceará, Brasil Pythian Tempo inteiro

    Overview Linux Site Reliability Consultant — Brazil | Remote | Work from Home. One available position for the following time zone: PST . Why Pythian At Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by...


  • Fortaleza, Ceará, Brasil beBeeReliability Tempo inteiro R$100.800 - R$128.000

    **System Reliability Specialist**The role of the System Reliability Specialist is to oversee the continuous delivery and setup of cloud-based systems, ensuring seamless operations and high availability.Key Responsibilities:Manage the configuration and deployment of cloud-based systems using modern tools and technologies.Provide timely responses to system...


  • Fortaleza, Ceará, Brasil beBeeDevops Tempo inteiro R$80.000 - R$140.000

    Job Overview">Your primary responsibility as a Site Reliability Engineer will be to ensure the seamless operation of our systems and services. This involves handling major incidents, performing in-depth application troubleshooting, and enhancing monitoring capabilities.">Key Responsibilities:">">Handling critical incidents via CIRS (Critical Issue Response...


  • Fortaleza, Ceará, Brasil beBeeInfrastructure Tempo inteiro R$250.000 - R$300.000

    As a key member of our team, you will be responsible for owning and enhancing mission-critical infrastructure.You will ensure the performance, stability, and innovation of our systems by applying advanced administration, troubleshooting, and kernel/performance tuning skills in Linux environments.Operation, upgrades, and capacity planning for CloudStack and...


  • Fortaleza, Ceará, Brasil beBeeSoftware Tempo inteiro US$110.000 - US$140.000

    AgileEngine is an innovative company that creates award-winning software for top brands and startups.Key ResponsibilitiesDesign automated invoice generation systems for various productsBuild and scale attribution logic systems to support multiple chains and assetsDevelop robust reporting capabilities for rewards and billing analyticsCreate efficient...