Lead Operational Intelligence Developer

Há 24 horas


Remoto, Brasil EPAM Systems Tempo inteiro R$80.000 - R$120.000 por ano

We are looking for a highly experienced and dynamic Lead Operational Intelligence Developer to join our team.

In this role, you will take ownership of leading the development, maintenance, and enhancement of our Elastic & Observability Platform deployed across GCP and Elastic Cloud. You will drive strategic initiatives, guide a high-performing technical team, and ensure platform reliability while fostering innovation and enabling self-service capabilities for platform consumers. This position also involves participating in an on-call rotation to oversee platform health and functionality.

Responsibilities

  • Oversee the availability, functionality, performance, and security of observability and search platforms to exceed business SLAs
  • Provide technical leadership during complex incidents and escalate resolutions promptly during on-call periods
  • Develop and maintain comprehensive platform documentation, standard operating procedures, and knowledge-sharing resources
  • Collaborate with cross-functional teams, stakeholders, and vendors to oversee operational requirements, drive strategic initiatives, and manage installations, troubleshooting, and upgrades
  • Lead the enhancement of platform features and self-service capabilities, including advanced Elastic Synthetics and chargeback automation
  • Architect and implement proof-of-concepts for platform innovation, such as AI-driven observability, advanced data processing models, or Kubernetes-based platform migration
  • Supervise the building, deployment, and maintenance of Elastic clusters using Infrastructure-as-Code (IaC) tools like Terraform and Ansible, while mentoring team members on best practices
  • Oversee platform lifecycle management activities, including component upgrades, capacity planning, cost optimization, and evolving compliance requirements
  • Continuously assess and fine-tune ELK stack performance, including ingestion, indexing, and query optimization for large-scale environments
  • Establish and enhance comprehensive alerting and incident management workflows, integrating sophisticated monitoring tools such as Kibana Rules, Watchers, and PagerDuty
  • Supervise the ingestion, enrichment, backup, and restoration of large-scale platform data while optimizing data workflows
  • Lead and plan critical operational events such as SSL certificate rotations, cluster migrations, or scalability optimization projects

Requirements

  • 5+ years of experience in Operational Intelligence, with a proven track record of leadership and technical expertise in managing large-scale observability platforms
  • Demonstrated ability to architect and manage Elastic clusters in complex, multi-cloud environments
  • In-depth knowledge of Elastic Stack components, including advanced configurations of Elasticsearch, Kibana, and Logstash
  • Advanced proficiency in Infrastructure-as-Code (IaC) tools like Terraform and Ansible, with demonstrated flexibility in adapting other tools like Jenkins CI or GitOps frameworks
  • Advanced Python scripting skills for automation, data processing, and extending platform interoperability
  • Deep understanding of incident management frameworks and workflows with tools like PagerDuty, Uptrends, and other enterprise monitoring solutions
  • Proven expertise in troubleshooting and resolving complex platform challenges under tight SLAs
  • Strong capability in managing and scaling fault-tolerant platforms while ensuring performance, security, and compliance across large distributed systems
  • Demonstrated ability to mentor and grow team members, manage priorities, and act as a bridge between technical and non-technical teams
  • Excellent command of English (B2+ level), both written and spoken, with a strong emphasis on technical communication skills

Nice to have

  • Expertise in scripting with Groovy or experience in advanced Linux administration to optimize platform processes
  • Track record of optimizing observability workflows with additional integrations or customizations in tools like Uptrends, PagerDuty, or Elastic features
  • Hands-on experience with advanced Elastic Synthetics setups for robust monitoring and custom synthetic testing frameworks
  • Experience driving strategic initiatives such as modernization through AI tooling, cloud-native transitions, or cost-saving observability optimizations

  • Lead DBT Developer

    1 semana atrás


    Remoto, Brasil EPAM Systems, Inc. Tempo inteiro R$90.000 - R$120.000 por ano

    We are looking for a detail-focused and experienced Lead DBT Developer to design, implement, and optimize data pipelines and models using dbt and Snowflake.As a subject matter expert, you will play an essential role in creating scalable data solutions, ensuring compliance with standards, and transferring knowledge to clients.ResponsibilitiesDesign high-level...


  • Remoto, Brasil Data Meaning Tempo inteiro R$90.000 - R$120.000 por ano

    Senior Full-Stack Developer (FastAPI + ReactJS/VueJS)Location: Brazil, remote.Position type: 2–3 months (Immediate Start) About Data MeaningData Meaning is a front-runner in Business Intelligence and Data Analytics consulting, renowned for our high-quality consulting services throughout the US and LATAM. Our expertise lies in delivering tailored solutions...

  • Lead AI Developer

    1 semana atrás


    Remoto, Brasil Ci&T Tempo inteiro R$80.000 - R$120.000 por ano

    We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.With over 7.400 CI&Ters around the world, we've built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.We are looking for AI-first engineers who use Generative AI as a foundation of software...


  • Remoto, Brasil Ci&T Tempo inteiro R$80.000 - R$150.000 por ano

    Somos especialistas em transformação tecnológica, unindo expertise humana à IA para criar soluções tech escaláveis. Com mais de 7.400 CI&Ters ao redor do mundo, já formamos parcerias com mais de 1.000 clientes durante nossos 30 anos de história. Inteligência Artificial é nossa realidade.Importante: se você reside na Região Metropolitana de...


  • Remoto, Brasil LUCKY365 CONSULTING LIMITED CORP. Tempo inteiro

    As a Customer Service Team Leader you will oversee both customer service and local operations workflows, ensuring seamless delivery of player support and operations. A key part of this role is to be highly resourceful in managing all online slots operations needs, including coordinating with vendors, handling game-related issues, and ensuring the timely...


  • Remoto, Brasil Ci&T Tempo inteiro R$10.000 - R$60.000 por ano

    We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.With over 7.400 CI&Ters around the world, we've built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.At CI&T, we are seeking a highly skilled and motivated Master Backend Developer...


  • Remoto, Brasil Ci&T Tempo inteiro R$80.000 - R$160.000 por ano

    We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.With over 7,400 CI&Ters around the world, we've built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.Required knowledge in:Java, Spring Boot, APIs, Microservices, Docker, and DevOps;NoSQL...


  • Remoto, Brasil Ci&T Tempo inteiro R$80.000 - R$120.000 por ano

    We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.With over 7.400 CI&Ters around the world, we've built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.At CI&T, we are seeking a highly skilled and motivated Mid/Senior Fullstack Developer...

  • Tech lead

    2 semanas atrás


    Remoto, Brasil Lean Tech Tempo inteiro R$150.000 - R$250.000 por ano

    Company Overview Lean Tech is a progressive organization, recognized for its influential network in software development and IT services. Our focus spans the entertainment, financial, and logistics sectors. Committed to professional empowerment and fostering a culture of innovation and inclusivity, our mission is to provide outstanding career advancement...


  • Remoto, Brasil Ci&T Tempo inteiro US$60.000 - US$80.000 por ano

    We are tech transformation specialists, uniting human expertise with AI to create scalable tech solutions.With over 7,400 CI&Ters around the world, we've built partnerships with more than 1,000 clients during our 30 years of history. Artificial Intelligence is our reality.At CI&T, we are seeking a highly skilled and motivated Mid-Level Fullstack Developer to...