Lead Site Reliability Engineer
Há 1 mês
This is a fully remote opportunity for one of our esteemed clients.
About the role:We are seeking a highly skilled and experienced
Lead SRE
to oversee the deployment, maintenance, and optimization of the DataDog observability platform across our R&D environment. This role is crucial for ensuring a unified, efficient, and secure monitoring infrastructure. You will lead API integrations, assist in platform modernization, and support teams with architectural insights and best practices for observability and monitoring.
Responsibilities
Oversee the deployment, maintenance, and configuration of DataDog for system monitoring, logging, and observability.Act as the primary point of contact for technical issues related to DataDog and observability tools.Lead API integrations and enhance platform capabilities to align with organizational needs.Monitor system performance and health, implementing proactive measures to prevent disruptions.Assist with the migration to service accounts and ensure best practices for user and key management.Provide operational and training support to R&D teams, ensuring efficient use of observability tools.Contribute to platform improvements and guide the adoption of OpenTelemetry or other modernization initiatives.
Required skills:
DataDog Expertise
(7-9 years): Advanced hands-on experience with DataDog, including monitoring, logging, dashboard creation, and APM configuration.Observability Tools
(7-9 years): Proficiency with tools like Prometheus and Grafana for system performance tracking.Cloud Platforms
(7-9 years): Extensive experience with AWS, including integration with DataDog for unified monitoring.Containerization and Microservices Monitoring
(4-6 years): Expertise in monitoring Kubernetes and containerized environments.Python
(4-6 years): Proficiency in Python for scripting and automating monitoring tasks.CI/CD Pipelines
(4-6 years): Experience integrating observability tools like DataDog into CI/CD workflowsInstallation and configuration of DataDog agents and integrations.User management, including roles, permissions, and security best practices.Leadership skills
Nice-to-have skills:OpenTelemetry Adoption:
Experience migrating from proprietary tracing models to OpenTelemetry for distributed tracing.API & Platform Migration: Expertise in transitioning to service account models and consolidating access keys for enhanced security.Automation
: Familiarity with automating monitoring setups and API configurations using scripting tools.
Type of contract
: Contractor. You will be responsible for your taxes.Contract length:
3-months (minimum). Renewable contract.Dedication:
Full-time (40 hours/week)Location:
100% remoteTimezone
: You’ll need to overlap at least 6 hours with US PST (UTC-4).
At Andela, we outcompete through diversity. We know that our strengths lie in the multiplicity of talents, perspectives, backgrounds, and orientations of residents in our community and we take pride in that. Andela is committed to a work environment in which all individuals are treated with respect and dignity. Each individual has the right to work in a professional atmosphere that promotes equal employment opportunities and prohibits discriminatory practices. Andela provides equal employment opportunities and workplace to all employees and applicants without regard to factors including but not limited to race, color, religion, gender, sexual orientation, gender identity, national origin, age, disability, pregnancy (including breastfeeding), genetic information, HIV/AIDS or any other medical status, family or parental status, marital status, amnesty or status as a covered veteran in accordance with applicable federal, state and local laws. This commitment applies to all terms and conditions of employment, including but not limited to hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training. Our policies expressly prohibit harassment and/or discrimination as stated above.
Andela is home for all, come as you are.
-
Consultor(a) Site Reliability Engineer
3 semanas atrás
Brasília, Brasil Ródio Tech Soluções Tempo inteiroEstamos à procura de um(a) Consultor(a) Site Reliability Engineer – Pleno para se juntar ao nosso time de profissionais excepcionais na RÓDIO TECH.Responsabilidades:Desenvolver e manter sistemas resilientes utilizando linguagens de programação como Java, GoLang, Kotlin, Groovy ou Shell scripting.Trabalhar com ferramentas de contêinerização...
-
Site Reliability Engineering
4 meses atrás
Brasília, DF, Brasil MIRANTE TECNOLOGIA Tempo inteiro**Cargo: Site Reliability Engineering - SRE** **Tipo de contratação**: CLT **Modalidade: Híbrido em Braília** **Formação: Superior em TI** **Requisitos Obrigatórios/**Principais atividades**: - Sólida experiência como SRE - Orquestração de Contêineres: Kubernetes - Cloud (AWS, Azure, GCP) - Banco de Dados SQL e NoSQL - Brokers de Mensagens:...
-
Site Reliability Engineer
2 meses atrás
Brasília, Brasil Sinqia Tempo inteiroPara gente dar #match, precisamos que você conheça e tenha:Vivência com atividades de suporte a usuários de negócio e desenvolvimento. Ter atuado em atividades de implementação de aplicações .NET e Java.Atuação em projetos com alta complexidade de integração sistêmica e bons conhecimentos em arquitetura de aplicação.Experiencia na...
-
Senior Devops Engineer
2 meses atrás
Brasília, Brasil Vingcard Tempo inteiroTITLE:Senior DevOPS EngineerLOCATION/GEOGRAPHY:Brasília, BrazilReports To:R&D ManagerRole Summary:As a Senior DevOps Engineer, you will play a pivotal role in managing and optimizing cloud infrastructure, ensuring the seamless integration of software development tools, and maintaining the security and reliability of
-
Technical Engineer Lead
4 semanas atrás
Brasília, Brasil Launchcode Tempo inteiroAbout Us:Launchcode is a cutting-edge technology company focused on revolutionizing the agricultural industry. Our innovative solutions leverage advanced software and IoT technologies to optimize operations and improve efficiency. We are currently seeking a skilled Technical Engineer Lead Full Stack to join our dynamic team.Important facts about this...
-
Platform Engineer
2 meses atrás
Brasília, Brasil Virtasant Tempo inteiroDo you want to work on cutting-edge projects with the world’s best IT engineers? Do you wish you could control which projects to work on and choose your own pay rate? Are you interested in the future of work and how the cloud will form teams? If so - this is the role for you.We are looking for an experiencedPlatform Engineerto join our team. This role...
-
Chief Engineers
Há 1 mês
Brasília, Brasil Svitzer Brasil Serviços Marítimos Ltda. Tempo inteiroHighest ranking engineer onboard, responsible for the safe operation, maintenance and reliability of the engine room and equipment onboard.How to Apply To apply for this role, please click on the 'Apply Now' button and create a Candidate Home to manage your applications.Diversity Statement In Svitzer we value the diversity of our talent and will always...
-
Data Engineer
Há 1 mês
Brasília, Brasil Insight Global Tempo inteiroMust-haves:5+ years of data engineer experienceExperience working with AWS architecture (RDS, Glue, EMR, EC2, S3, Postgres, EMR, etc..)Snowflake data warehousingPython & SQL codingDay to Day:Insight Global is looking for 3 remote data engineers to join the analytics organization at a global medical device client. We are establishing a new data analytics...
-
Data Engineer
Há 1 mês
Brasília, Brasil Insight Global Tempo inteiroMust-haves: 5+ years of data engineer experienceExperience working with AWS architecture (RDS, Glue, EMR, EC2, S3, Postgres, EMR, etc..)Snowflake data warehousingPython & SQL codingDay to Day:Insight Global is looking for 3 remote data engineers to join the analytics organization at a global medical device client. We are establishing a new data analytics...
-
Cloud Engineer CLT
3 semanas atrás
Brasília, Brasil INTEGRA RH Tempo inteiroCandidato deverá ser capaz de arquitetar soluções para o melhor uso dos recursos disponíveis nos provedores de nuvem visando disponibilidade, tolerância a falhas, desempenho, segurança e recuperação (Site Reliability Engineering - SRE), buscando sempre melhorar a relação custo-benefício das implementações.O Projeto utiliza algumas das últimas...
-
Senior / Lead Backend Engineer (Go, Golang)
5 meses atrás
Brasília, Brasil PRAGMATIKE Tempo inteiroJob Description: Location: Fully remote, EU timezone (CET +/- 2hours)Start date: ASAPLanguages: English is mandatory; French is a plusOur client: Cloud Computing / Blockchain / AI - European Saas Responsibilities: Design and develop scalable, distributed, server-side software applications and microservicesCollaborate within an Agile Scrum team to define and...
-
Cloud Engineer
3 semanas atrás
Brasília, Brasil CONVERGÊNCIA Tempo inteiroCandidato deverá ser capaz de arquitetar soluções para o melhor uso dos recursos disponíveis nos provedores de nuvem visando disponibilidade, tolerância a falhas, desempenho, segurança e recuperação (Site Reliability Engineering – SRE), buscando sempre melhorar a relação custo-benefício das implementações. O Projeto utiliza algumas das...
-
Sr Engineer/ Technical lead
4 semanas atrás
Brasília, Brasil Turing Tempo inteiroCompany Overview:We are a pioneering organization in the field of generating training data for all leading large language models to advance Artificial General Intelligence (AGI), esp in the domains of coding, advanced reasoning, planning, STEM, etc.. Our vision is to design the best systems to combine human knowledge and model capability into training data...
-
Azure Senior Data Engineer
Há 14 horas
Brasília, Brasil Ubique Systems Tempo inteiroAzure Senior Data Engineer who has strong experience in DWH and ETL technologies on Big Data and Azure cloud and experience in real time data processing. Candidates must have strong communication skills that allow you to regularly deal with stakeholders, as well as with specialists from related fields including POs, architects, and other Engineers.They...
-
AI Validation Engineer
3 semanas atrás
Brasília, Distrito Federal, Brasil Excellent Opportunity Tempo inteiroAI Validation EngineerWe are seeking a highly skilled AI Validation Engineer to join our team at Excellent Opportunity.Responsibilities:Review code and solutions generated by AI systems to ensure adherence to quality standards and best practices.Organize the development cycle, manage project priorities, and set goals and deadlines.Utilize expertise in Go...
-
Cloud Engineer
3 semanas atrás
Brasília, Brasil CONVERGÊNCIA Tempo inteiroCandidato deverá ser capaz de arquitetar soluções para o melhor uso dos recursos disponíveis nos provedores de nuvem visando disponibilidade, tolerância a falhas, desempenho, segurança e recuperação (Site Reliability Engineering – SRE), buscando sempre melhorar a relação custo-benefício das implementações.O Projeto utiliza algumas das últimas...
-
Principal Java Engineer
Há 1 mês
Brasília, Brasil Ranger Technical Resources Tempo inteiroPrincipal Software Engineer #2409Position Summary:Our partner, a fast-growing SaaS company that provides intuitive remote monitoring and endpoint management software for IT teams, is seeking a Principal Software Engineer to join their expanding Mainline team. In this pivotal role, you will be instrumental in the efficient operation and strategic evolution of...
-
Cloud Engineer
3 semanas atrás
Brasília, Brasil CONVERGÊNCIA Tempo inteiroCandidato deverá ser capaz de arquitetar soluções para o melhor uso dos recursos disponíveis nos provedores de nuvem visando disponibilidade, tolerância a falhas, desempenho, segurança e recuperação (Site Reliability Engineering - SRE), buscando sempre melhorar a relação custo-benefício das implementações. O Projeto utiliza algumas das...
-
Tech Lead
4 semanas atrás
Brasília, Brasil Konsi Tempo inteiroTech Lead - Analytics EngineerNível: Sênior / EspecialistaModalidade: CLT ou PJLocal de trabalho: RemotoDescrição da vagaEstamos buscando um(a) profissional experiente para se juntar à equipe e liderar os esforços de Dados, principalmente emAnalytics Engineering. A pessoa selecionada será referência técnica emmodelagem e entrega de dados para...
-
Cloud Engineer CLT
3 semanas atrás
Brasília, Brasil INTEGRA RH Tempo inteiroCandidato deverá ser capaz de arquitetar soluções para o melhor uso dos recursos disponíveis nos provedores de nuvem visando disponibilidade, tolerância a falhas, desempenho, segurança e recuperação (Site Reliability Engineering - SRE), buscando sempre melhorar a relação custo-benefício das implementações. O Projeto utiliza algumas das últimas...