Senior Site Reliability

2 meses atrás


São Paulo, Brasil SkySys Tempo inteiro
Role: Senior Site Reliability Engineer
Position Type: Full-Time Contract (40hrs/week)
Contract Duration: 6-8 Months+
Work Hours: Eastern Standard Time (EST)
Work Schedule: 8 hours/day (Mon-Fri)
Location: 100% Remote in Brazil

Overview:

The Site Reliability Engineer (SRE) plays a critical role in ensuring the reliability, scalability, and performance of Client's digital platforms and infrastructure. As part of a global team of highly skilled engineers, the SRE will work on challenging and impactful projects that directly contribute to the company's core business activities. Client is committed to fostering a culture of innovation, collaboration, and continuous learning, providing the SRE with an opportunity to grow and develop their skills while making a positive impact on the world.

Main Accountabilities: Troubleshoot and resolve infrastructure issues and incidents in a timely manner. Design, implement, and maintain reliable and scalable infrastructure solutions to support Client's digital platforms and applications. Monitor and analyze system performance, identify potential issues, and take proactive measures to prevent outages and disruptions. Collaborate with cross-functional teams, including software engineers, product managers, and operations personnel, to ensure seamless integration of infrastructure and application components. Develop and implement automation scripts and tools to streamline infrastructure management tasks and improve operational efficiency. Stay up to date with industry best practices and emerging technologies in the field of site reliability engineering. Close cooperation with DevOps and Cloud engineers. Impact/Dimensions: Contributes to the reliability and uptime of Client's digital platforms, which are critical for the company's global operations and customer satisfaction. Works on projects that have a direct impact on Client's revenue and profitability. The individual in this role will have a significant impact on the efficiency and effectiveness of Client's technology operations and will be responsible for driving continuous improvement initiatives that save the company time and money. Key Performance Indicators (KPIs): Mean Time to Repair (MTTR) for critical systems System uptime and availability Number of incidents and outages prevented Customer satisfaction with infrastructure performance Major Opportunities and Decisions: Identifying and mitigating potential risks to infrastructure stability and performance. Making decisions on infrastructure investments and resource allocation to optimize cost-effectiveness and scalability. Balancing the need for innovation with the requirement for stability and reliability in infrastructure operations. Management/Leadership: Leads and mentors a team of junior SREs and infrastructure engineers. Provides technical guidance to cross-functional teams on infrastructure-related matters. Actively participates in shaping the company's infrastructure strategy and roadmap. Key Relationships, Stakeholders & Interfaces (External & Internal): Works closely with software engineering teams to ensure seamless integration of infrastructure and application components. Development teams Infrastructure teams Business stakeholders Vendors and partners Knowledge and Technical Competencies: Strong understanding of SRE & DevOps principles and practices. Experience with CI/CD Azure DevOps platform. Knowledge of infrastructure management tools such as Ansible, Puppet, or Chef. Solid experience with containerization such as Docker and orchestration tools such as Kubernetes. Solid knowledge about security aspects in cloud and on-premises. Proficient in scripting languages such as Python or Bash. Experience with cloud computing platforms such as AWS and Azure where GCP is preferred. Experience with monitoring software such as Datadog, Zabbix, Kibana etc. Hand-on coding, deploying, and supporting large scale, serverless architectures. Infrastructure provisioning with Terraform or CloudFormation (IaaC). Experience with Linux and Windows operating systems. Strong problem-solving and analytical skills. Excellent communication and interpersonal skills. Education/Experience: Bachelor's degree in computer science or a related field. 5+ years of experience in DevOps engineering. Experience with leading teams and managing projects. Very good knowledge of English in general.

  • São Paulo, São Paulo, Brasil Newsela Tempo inteiro

    Newsela is seeking a skilled Senior-Level Site Reliability Engineering Services professional to join our team based out of Brazil or Argentina.Key Responsibilities:Participate in an on-call rotation to respond to incidents impacting Newsela.Com availability and provide support to developers during internal and external incidents.Maintain and extend our...


  • São Paulo, Brasil Winnin Tempo inteiro

    Somos a locadora do futuro e aceleramos em oferecer uma experiência incrível, 100% digital, sustentada pelo uso sem freios de tecnologia e amor aos nossos Turbilovers (sim! Nós abrimos carros pelo app, isso é sobre tecnologia).Estamos em busca de uma pessoa para a posição de Site Reliability Engineer.Os desafios que irá encontrar na sua corrida...


  • São Paulo, Brasil Ebury Tempo inteiro

    Site Reliability Engineer - Senior Platform Engineer at Ebury, São PauloEbury is a hyper-growth FinTech firm, named in 2021 as one of the top 15 European Fintechs to work for by AltFi. We offer a range of products including FX risk management, trade finance, currency accounts, international payments, and API integration.SRESão Paulo Office - Hybrid: 4 days...


  • São Paulo, Brasil Turbi Tempo inteiro

    E aí, tudo azul?Somos a locadora do futuro e aceleramos em oferecer uma experiência incrível, 100% digital, sustentada pelo uso sem freios de tecnologia e amor aos nossos Turbilovers (sim! Nós abrimos carros pelo app, isso é sobre tecnologia).Estamos em busca de uma pessoa para a posição de Site Reliability Engineer.Os desafios que irá encontrar na...


  • SAO PAULO, Brasil Grupo Hub Tempo inteiro

    About the Role Our company employs a diverse array of systems and technologies to deliver our products. As a Senior Site Reliability Engineer, you will work closely with software engineering teams to focus on software development and infrastructure design, providing expertise in performance, stability, and scalability. You will report to the Site Reliability...

  • Site Reliability Engineer

    2 semanas atrás


    São Paulo, Brasil AMARIS GROUP SA Tempo inteiro

    Procuramos por consultores dinâmicos para aumentar nossa equipe de Sistemas de Informação e Digital em São Paulo . Sua experiência, conhecimento e compromisso nos ajudarão a enfrentar os desafios de nossos clientes.Você apoiará diferentes projetos através de sua experiência como Site Reliability Engineer .Suas principais...


  • São Paulo, Brasil Clínica Da Cidade Tempo inteiro

    Senior Site Reliability Engineer (Relocation to Portugal)Localização: São PauloEmpresa: TeyaImportant: this role will be based in our Tech Hub in Porto, Portugal, and we will provide relocation support & VISA sponsorship.To ensure collaboration across all offices, interviews and work at Teya is done in English, so please consider this when applying.About...

  • Site Reliability Engineer

    4 semanas atrás


    São Paulo, Brasil UNIKRH Tempo inteiro

    ResponsabilidadesAssegurar a alta disponibilidade e resiliência dos sistemas através de práticas de SRE (Site Reliability Engineering).Monitorar e otimizar o desempenho dos sistemas para atender às demandas de escalabilidade e eficiência.Automatizar tarefas operacionais repetitivas, implementando scripts e ferramentas para melhorar a eficiência e...


  • São Paulo, Brasil isaac Tempo inteiro

    Bora fazer parte de um time brilhante em uma Fintech que veio pra transformar o futuro da Educação no Brasil?No isaac, operamos com um time enxuto de Site Reliability Engineering (SRE) que lidera nossa estratégia de Platform Engineering, priorizando a autonomia das equipes de desenvolvimento. Estamos focados em alavancar a produtividade das pessoas e a...


  • São Paulo, São Paulo, Brasil isaac Tempo inteiro

    Desenvolva sua carreira em uma Fintech inovadora!Estamos procurando por um Engenheiro de Confiabilidade de Site Sênior para integrar nosso time de Site Reliability Engineering (SRE) em uma plataforma de Educação no Brasil.O time de SRE é responsável por liderar nossa estratégia de Platform Engineering, priorizando a autonomia das equipes de...


  • São Paulo, São Paulo, Brasil SkySys Tempo inteiro

    Job DescriptionAt SkySys, we are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will play a critical role in ensuring the reliability, scalability, and performance of our digital platforms and infrastructure.Main Responsibilities:Troubleshoot and resolve infrastructure issues and...


  • São Paulo, São Paulo, Brasil isaac Tempo inteiro

    O isaac é uma fintech de educação que busca construir soluções inovadoras para a gestão financeira escolar.Nossa missão é apoiada por investidores de grande porte, como SoftBank e General Atlantic.Como Senior Site Reliability Engineer, você protagonizará iniciativas de automação de processos de infra/engenharia e trabalhará com diversas áreas...


  • São Paulo, Brasil NinjaOne, LLC Tempo inteiro

    Titulo da Vaga: Site Reliability EngineerNome da Empresa: NinjaOne, LLCSalário:Localização: São Paulo – SPDescrição da Vaga:About the RoleAt NinjaOne we are passionate about building unified IT solutions that simplify the way IT organizations work. We are currently looking for a Site Reliability Engineer to join our SRE team in the Platform...


  • São Paulo, Brasil https:www.energyjobline.comsitemap.xml Tempo inteiro

    Job DescriptionYour MissionProvide self-service cloud-native products for delivery teams while matching business requirements such as security, compliance, cost, and reliability.As a Senior SRE, you will:Take part in the design, development, deployment, and management of infrastructure productsEvangelize the best practices around observability, reliability,...

  • Site Reliability Engineer

    4 semanas atrás


    São Paulo, Brasil UNIKRH Tempo inteiro

    ResponsabilidadesAssegurar a alta disponibilidade e resiliência dos sistemas através de práticas de SRE (Site Reliability Engineering).Monitorar e otimizar o desempenho dos sistemas para atender às demandas de escalabilidade e eficiência.Automatizar tarefas operacionais repetitivas , implementando scripts e ferramentas para melhorar a eficiência e...


  • São Paulo, Brasil Newsela Tempo inteiro

    Seeking to hire a Contractor based out of Brazil or Argentina for Senior-Level Site Reliability Engineering Services.Scope of Services:Be on an on-call rotation to respond to incidents that impact Newsela.Com availability and provide support for developers during internal and external incidentsMaintain and assist in extending our infrastructure with...


  • São Paulo, São Paulo, Brasil isaac Tempo inteiro

    Desenvolva sua carreira como Engenheiro de Confiabilidade de Site Sênior em isaacEstamos procurando por um Engenheiro de Confiabilidade de Site Sênior para se juntar a nossa equipe de Site Reliability Engineering (SRE) em isaac. Nossa equipe é responsável por garantir a confiabilidade e a escalabilidade de nossos serviços em nuvem.ResponsabilidadesCriar...


  • São Paulo, Brasil Compass.UOL Tempo inteiro

    Job descriptionWe are looking for a Site Reliability Engineer (SRE) who will be responsible for applying best practices and global standards, aligning with the SRE culture.Main responsibilitiesAplicar melhores práticas e padrões globais, alinhando-se à cultura SRE; Desenvolver soluções e ferramentas de automação para melhorar a usabilidade do software...

  • Site Reliability Engineer

    4 semanas atrás


    São Paulo, Brasil ENGINEERINGUK Tempo inteiro

    You will need to login before you can apply for a job.Do you have programming, cloud infrastructure, and containerization knowledge?Would you like to join our great reliability engineering team?About the RoleThis entry-level Site Reliability Engineer will ensure the reliability, availability, and performance of company systems, and will collaborate with...

  • Site Reliability Engineer

    3 semanas atrás


    São Paulo, Brasil LexisNexis Risk Solutions Tempo inteiro

    Do you have programming, cloud infrastructure, and containerization knowledge?Would you like to join our great reliability engineering team?About the RoleThis entry-level Site Reliability Engineer will ensure the reliability, availability, and performance of company systems, and will collaborate with teams to solve issues, automate processes, and enhance...