Site Reliability Engineering

1 semana atrás


São Paulo, São Paulo, Brasil Wildlife Studios Tempo inteiro

We're looking for a talented and passionate Site Reliability Engineering Manager, to join Wildlife's Cloud Platform team.

As an SRE Manager you will have the goal to provide easy-to-use, highly available systems to all the engineers in the company. As an SRE Manager of the Compute team, your main goal is to enable your team to improve the infrastructure services, using and refining our existing automations while being able to contribute in technical and business decisions for new services that will support the scalability and usability of the infrastructure services in the company and improving the team career growth, engagement and retention.

We know that the work we do has a high impact on our company's success and culture. The right person for this position is curious by nature, proactive, loves solving problems, and can thrive in a fast and growing business.

What you'll do
  • Be the manager of a cross-functional team, contributing to the team roadmap and growth of its individual contributors.
  • Develop, maintain, and optimize infrastructure clusters (Kubernetes, NATS, ETCD, Postgres, MongoDB, Redis, Elasticsearch) and our APIs and Automations to manage them (Kubernetes Operators, Infrastructure as code, Pipelines, CLIs).
  • Analyze costs of infrastructure services and help define and optimize the budget of our infrastructure and game teams.
  • Contribute to improvements on monitoring and observability patterns for infrastructure services.
  • Troubleshoot, manage and lead incidents in production.
  • Automate and improve infrastructure provisioning, by augmenting our Infrastructure as Code or implementing new features and infrastructure services in our internal tools.
  • Help partner teams to architect and scale their applications and infrastructure with cloud-native best practices.
What you'll need

We expect our Managers to be Technical, dedicating around 50% of their time to working together with the ICs in their day-to-day work and being an active voice and participative on the team technical roadmap.

  • Experience managing small teams with infrastructure background.
  • Some level of leadership skills, including the areas of people management, communications, project management, talent development, performance management, team effectiveness, agility, hiring, decision making, planning, budgeting, and collaboration.
  • Coding experience in at least one programming language. We work mostly with Go and Python.
  • University degree in courses related to computing such as Computer Engineering, Computer Science, Information Systems, and Systems Analysis and Development or equivalent Market Experience.
  • Solid understanding of computer concepts (operational systems, networking, concurrency, memory management, and algorithm analysis).
  • Experience with cloud computing services such as Amazon AWS, Google Cloud, or Microsoft Azure.
  • Experience with Infrastructure as Code automations, such as Terraform, Packer, Ansible, Crossplane, etc.
  • Experience managing Kubernetes clusters and developing Kubernetes operators.
  • Experience automating routine tasks, such as deployments and monitoring setup.
  • Experience with incident management and being oncall for productive systems and workloads.
  • Strong written and spoken communication skills in English.
  • Experience with complex, large-scale, and high-available systems.
  • Experience with monitoring and telemetry in applications and infrastructure.
  • History of technical leadership and ownership of critical projects, including the mentoring of junior team members.
More about you
  • Player focused. We are player-oriented and infrastructure has a great impact on their experience. You have empathy with our players and focus on ensuring they have an amazing experience.
  • Automation is key to scaling. We look for engineers who have a history of projecting and executing automation projects in order to get rid of any manual and repetitive tasks.
  • Calm and pragmatism. When everything seems to be falling apart around you, you have a plan and keep calm.
  • Bleeding edge. You are curious and like to study new technologies, test new solutions, and measure the impact brought by changes.
  • Metrics oriented. We make decisions based on data and metrics. We measure the results of our tasks against the expected outcome.
  • Bar raiser. You want to elevate your team skills and raise the bar, by mentoring your peers, spreading knowledge, being proactive and a tech lead.
About Wildlife

Wildlife is one of the leading mobile game developers and publishers in the world. We have released more than 60 titles, reaching billions of people around the globe. Here, we create games that will excite, intrigue, and engage our players for years to come

Wildlife is proud to be an Equal Opportunity and Affirmative Action employer. We do not discriminate based upon race, color, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.

We're committed to providing accommodations for candidates with disabilities in our recruiting process.

Apply now #J-18808-Ljbffr
  • Site Reliability Engineering

    4 semanas atrás


    São Paulo, São Paulo, Brasil Veriff Tempo inteiro

    Site Reliability Engineering (SRE) CoachThe Engineering team builds the software powering Veriff. We're on a mission to foster operational excellence and reliability through the adoption of Site Reliability Engineering (SRE) principles. It's a dynamic and fast-paced environment filled with challenges and opportunities. To continue innovating and raising the...

  • Site Reliability Engineering

    2 semanas atrás


    São Paulo, São Paulo, Brasil Veriff Tempo inteiro

    Site Reliability Engineering (SRE) Coach The Engineering team builds the software powering Veriff. We're on a mission to foster operational excellence and reliability through the adoption of Site Reliability Engineering (SRE) principles. It's a dynamic and fast-paced environment filled with challenges and opportunities. To continue innovating and raising...


  • São Paulo, São Paulo, Brasil Amazon Tempo inteiro

    Maintenance Engineering Planner, Amazon Reliability Maintenance Engineering - IntlRMEWe are looking for motivated, customer-focused people who want to join our team as a Reliability Maintenance Engineering Planner. The Reliability Maintenance Engineering Planner is responsible for asset and spares management, preventative maintenance planning, and machine...


  • São Paulo, São Paulo, Brasil Veriff Tempo inteiro

    The Engineering team builds the software powering Veriff.We're on a mission to foster operational excellence and reliability through the adoption of Site Reliability Engineering (SRE) principles.It's a dynamic and fast-paced environment filled with challenges and opportunities.To continue innovating and raising the bar for system resilience, we're seeking...


  • São Paulo, São Paulo, Brasil Wildlife Studios Tempo inteiro

    We're looking for a talented and passionate Site Reliability Engineering Manager, to join Wildlife's Cloud Platform team.As an SRE Manager you will have the goal to provide easy-to-use, highly available systems to all the engineers in the company. As an SRE Manager of the Compute team, your main goal is to enable your team to improve the infrastructure...

  • Site Reliability Engineering

    3 semanas atrás


    São Paulo, São Paulo, Brasil Veriff Tempo inteiro

    The Engineering team builds the software powering Veriff. We're on a mission to foster operational excellence and reliability through the adoption of Site Reliability Engineering (SRE) principles. It's a dynamic and fast-paced environment filled with challenges and opportunities. To continue innovating and raising the bar for system resilience, we're seeking...

  • Site Reliability Engineering

    3 semanas atrás


    São Paulo, São Paulo, Brasil Veriff Tempo inteiro

    The Engineering team builds the software powering Veriff. We're on a mission to foster operational excellence and reliability through the adoption of Site Reliability Engineering (SRE) principles. It's a dynamic and fast-paced environment filled with challenges and opportunities. To continue innovating and raising the bar for system resilience, we're seeking...


  • São Paulo, São Paulo, Brasil Cwi Software Tempo inteiro

    Nessa posição de SRE (Site Reliability Engineering) você irá atuar em equipe multidisciplinar, participando efetivamente do desenho da solução e definições de arquitetura da solução.Você irá atuar como responsável pela administração de ambientes kubernetes, criação e gestão de pipelines, criação de métricas e dashboards, automação de...


  • São Paulo, São Paulo, Brasil Wildlife Studios Tempo inteiro

    About the RoleThe Site Reliability Engineering Manager will be responsible for overseeing the design, implementation, and maintenance of our cloud-based infrastructure services. This includes managing a team of engineers who develop, maintain, and optimize these services.Responsibilities:Develop and execute strategies for improving infrastructure scalability...


  • São Paulo, São Paulo, Brasil Internetwork Expert Tempo inteiro

    Intuition Machines uses AI/ML to build enterprise security products. We apply our research to systems that serve hundreds of millions of people, with a team distributed around the world. You are probably familiar with our best-known product, the hCaptcha security suite. Our approach is simple: low overhead, small teams, and rapid iteration.As a Site...


  • São Paulo, São Paulo, Brasil Wildlife Studios Tempo inteiro

    Our VisionAt Wildlife Studios, we create games that excite, intrigue, and engage our players for years to come. We're committed to providing an amazing experience for our players, and our infrastructure has a significant impact on this.We're looking for a seasoned Site Reliability Engineering Manager who shares our passion for innovation and customer...


  • São Paulo, São Paulo, Brasil Loadsmart Inc Tempo inteiro

    ARE YOU INTERESTED IN JOINING A HYPER-GROWTH LOGISTICS TECH COMPANY?Loadsmart is a growth-stage start-up technology company valued at over $1 billion (a true Tech Unicorn)We are looking for a talented Sr. Site Reliability Engineer to join our teamIn this role, you will build and maintain the company's internal platform, driving operational excellence and...


  • São Paulo, São Paulo, Brasil Veriff Tempo inteiro

    Company OverviewWe are a leading identity verification platform partner for innovative growth-driven organizations, committed to fostering operational excellence and reliability through the adoption of Site Reliability Engineering principles. Our mission is to protect honest people online by promoting system resilience and continuous improvement.Job...


  • São Paulo, São Paulo, Brasil Wildlife Studios Tempo inteiro

    Our Team">We're a team of passionate professionals dedicated to delivering exceptional gaming experiences. Our Cloud Platform team is responsible for designing, building, and maintaining the infrastructure that powers our games. As a Site Reliability Engineering Manager, you'll play a crucial role in ensuring our systems are reliable, scalable, and...


  • São Paulo, São Paulo, Brasil buscojobs Brasil Tempo inteiro

    Infrastructure & Technology Site Reliability Engineering ProfessionalLocation: Hortolandia, BRSalary: BRL 80.000 - 120.000Responsibilities:Provide technical support using problem determination and problem source.Utilize technical and negotiation skills in collaboration with other support operations/organizations to prioritize and diagnose problems to...


  • São Paulo, São Paulo, Brasil Amazon Tempo inteiro

    Maintenance Engineering Planner, Amazon Reliability Maintenance Engineering - IntlRMEWe are looking for motivated, customer-focused people who want to join our team as a Reliability Maintenance Engineering Planner. The Reliability Maintenance Engineering Planner is responsible for asset and spares management, preventative maintenance planning, and machine...


  • São Paulo, São Paulo, Brasil Givaudan Tempo inteiro

    Step into our world of creativity and joySite Engineering Manager- Join us and celebrate the beauty of human experience.Create for happier, healthier lives, with love for nature.Together, with kindness, humility, and a spirit of adventure, we deliver food innovations, craft inspired fragrances and develop beauty and wellbeing solutions.There's much to learn...

  • Site Reliability Engineer

    4 semanas atrás


    São Paulo, São Paulo, Brasil GO.K - One Step Ahead Tempo inteiro

    Sobre o desafio Estamos em busca de um Site Reliability Engineer (SRE) Sênior com ênfase em Infraestrutura para atuar na construção, disponibilidade, escalabilidade e eficiência da nossa infraestrutura.O profissional será responsável pela concepção, automação de processos, otimização de sistemas distribuídos, implementação de observabilidade...

  • Site Reliability Engineer

    1 semana atrás


    São Paulo, São Paulo, Brasil OneBox Tempo inteiro

    Venha ser nosso SRE e garanta a confiança dos nossos sistemasEstamos em busca de um Site Reliability Engineer (SRE) para se juntar ao nosso time e assegurar que nossos serviços e plataformas estejam sempre disponíveis, escaláveis e resilientes. Se você é apaixonado por tecnologia, adora automatizar processos e tem experiência em ambientes de alta...

  • Site Reliability Engineer

    4 semanas atrás


    São Paulo, São Paulo, Brasil Capgemini Tempo inteiro

    Nossa Oportunidade Estamos em busca de um Site Reliability Engineer | SRE Pleno - Azure Cloud. O candidato(a) ideal deve ter perfil hands-on em resolução de incidentes, conhecimento em Cloud Azure e Dynatrace. Se você é uma pessoa talentosa e apaixonada por tecnologia que procura uma função dinâmica, então somos o lugar para você. Nossa oportunidade...