Site Reliability Engineer

Há 2 dias


Brasil Kraken Tempo inteiro
Overview

Site Reliability Engineer - Data Platform role at Kraken. Join our Data Infrastructure team to uphold the reliability, scalability, and efficiency of our data platform.

Responsibilities
  • Design the data governance mechanisms that ensure our lakehouse is easy to interact with, secure and in compliance with all applicable regulations.
  • Implement the infrastructure we use to ingest our data, store it, catalog it with the right metadata and capture its lineage.
  • Provide a state-of-the-art suite of BI tools for multiple teams within the company.
  • Guarantee the availability, high performance, scalability and cost efficiency of our data platform.
  • Implement data infrastructure solutions (self service) that support the needs of 10+ business units and over 100 engineering and data analysts
  • Utilize Infrastructure as Code (IaC) principles to design, provision, and manage both on-premises and cloud (AWS) infrastructure components using tools such as Terraform
  • Develop and maintain automation scripts using bash/shell scripting to automate operational tasks and deployments
  • Enhance and manage CI/CD pipelines to facilitate consistent software deployments across the data infrastructure
  • Implement robust data monitoring and alerting solutions to proactively detect anomalies and performance issues
  • Manage and implement role-based access control (RBAC) and permissions for multiple user groups and machine workflows across environments
  • Manage and maintain real-time streaming data architecture using technologies like Kafka and Debezium (CDC)
  • Ensure timely and accurate processing of streaming data for insights
  • Utilize Kubernetes to manage containerized applications within the data infrastructure
  • Implement incident response procedures and participate in on-call rotations
  • Collaborate with data analysts, engineers, and cross-functional teams to understand requirements and implement solutions
  • Document architecture, processes, and best practices to enable knowledge sharing and continuous improvement
  • Support AI/ML teams with their infra requests
Qualifications
  • Proven experience (5+ years) as a Site Reliability Engineer, Infrastructure Engineer, Data Infrastructure Engineer, or similar roles with a focus on data infrastructure and security
  • Experience with real-time data processing technologies such as Kafka, Flink, and Debezium
  • Experience managing hybrid multi-tenant cloud systems, particularly on AWS
  • Infrastructure as Code tools such as Terraform, Terragrunt and Atlantis
  • Experience with containerization/orchestration tools (Kubernetes, Nomad, Docker)
  • Strong Bash/shell scripting and proficiency in at least one programming language (preferably Python or JVM languages)
  • Experience with data technologies: Apache Airflow, Apache Spark, databases, BI tooling
  • Experience solving data access management at large-scale data lakes
  • Familiarity with CI/CD deployment pipelines and related tools
  • Strong problem-solving skills and ability to troubleshoot complex systems

This job is accepting ongoing applications and there is no application deadline.
Please note, applicants may redact or remove information identifying age, date of birth, or dates of attendance/graduation on their resume.
We consider qualified applicants with criminal histories for employment consistent with the San Francisco Fair Chance Ordinance.

Kraken is powered by people from around the world and we celebrate diverse talents, backgrounds, and perspectives. We hire based on merit and encourage applying for roles even if you don\'t meet every listed requirement, especially if you\'re passionate about crypto. Kraken is an equal opportunity employer; we do not tolerate discrimination or harassment. See Kraken\'s Career and Privacy policies for more information.

#J-18808-Ljbffr
  • Site Reliability Engineer

    2 semanas atrás


    Brasil Aubay Portugal Tempo inteiro

    Aubay Portugal is a multinational French company, in Portugal since 2007. We have offices in Lisbon and Oporto and we are a specialized consultant in Management, Implementation, Development and Maintenance of Information Systems. We have more than 150 active partners and we operate in sectors such as banking, insurance, telecommunications, services, energy...

  • Site Reliability Engineer

    1 semana atrás


    Brasil Seedify Tempo inteiro US$90.000 - US$120.000 por ano

    Seedify is a leading cryptocurrency launchpad platform dedicated to fostering innovation and success in the Web3 space. Our mission is to identify and assist promising teams and projects and offer outstanding returns to our investor base.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer with extensive experience in DevOps,...


  • Brasil Housecall Pro Tempo inteiro

    Join to apply for the Senior DevOps Site Reliability Engineer role at Housecall Pro Join to apply for the Senior DevOps Site Reliability Engineer role at Housecall Pro Get AI-powered advice on this job and more exclusive features. TO BE CONSIDERED FOR THIS ROLE, PLEASE SUBMIT AN UPDATED RESUME TRANSLATED TO ENGLISH Who is Housecall Pro? Housecall Pro is...


  • Brasil Articul8 AI Tempo inteiro

    Senior Site Reliability Engineer (SRE) - (Brazil)Senior Site Reliability Engineer (SRE) - (Brazil)2 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. About Us Articul8 AI is at the forefront of Generative AI innovation, delivering cutting-edge SaaS products that transform how businesses operate. Our...


  • Brasil beBeeEngineering Tempo inteiro €60.000 - €90.000

    Aubay Portugal is a multinational French company operating in sectors such as banking, insurance, telecommunications, services, energy and transports. About the JobWe are looking for a skilled Site Reliability Engineer with experience in Azure Cloud and Kubernetes to join our team.4 years of experience as a Site Reliability Engineer;Experience with Azure...

  • site reliability engineer

    3 semanas atrás


    Brasil Bernoulli Educação Tempo inteiro

    Join to apply for the SITE RELIABILITY ENGINEER role at Bernoulli Educação Join to apply for the SITE RELIABILITY ENGINEER role at Bernoulli Educação Se o olho brilha, vem ser Bernoulli Somos feitos de pessoas que acreditam no poder transformador da educação. Gente criativa, determinada e que gosta de aprender. Profissionais que enxergam os...

  • Site Reliability Engineer

    2 semanas atrás


    Brasil Pythian Tempo inteiro

    Site Reliability Engineer Multiple timezones available |Remote | Work from Home Why Pythian: At Pythian, we are experts in strategic database and analytics services, driving digital transformation and operational excellence. Pythian, a multinational company, was founded in 1997 and started by ensuring the reliability and performance of mission-critical...


  • Brasil DuckDuckGo Tempo inteiro

    6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Who We Are Hi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our...

  • Site Reliability Engineer

    3 semanas atrás


    Brasil Parfin Tempo inteiro

    About Parfin Parfin is the leading web3 infrastructure provider in Latin America. We offer institutions an end-to-end solution for digital asset custody, trading, tokenization, and management. Our clients include some of the largest banks and crypto-native companies in Latin America. We accelerate institutional adoption of web3 by creating solutions that...

  • Site Reliability Engineer

    2 semanas atrás


    Brasil Aubay Portugal Tempo inteiro

    Site Reliability Engineer - Relocation to Portugal Aubay Portugal is a multinational French company, with offices in Lisbon and Oporto. We are a specialized consultant in Management, Implementation, Development and Maintenance of Information Systems, and we operate in sectors such as banking, insurance, telecommunications, services, energy and transports....