Reliability Engineer

Há 2 dias


Brasil Flinks Tempo inteiro

Flinks is where financial data moves—with purpose, trust, and impact.

We're on a mission to simplify access to financial data and help businesses build better, faster, and more secure financial products and experiences. Since 2016, we've been bridging the gap between fintechs, financial institutions, and consumers by enabling seamless, secure data connectivity.

From instant account funding to smarter lending, our solutions help power some of the most innovative financial products in North America. We partner with lenders, banks, and fintechs to streamline onboarding, prevent fraud, and fuel real-time decision-making with enriched, reliable data.

As pioneers in Canada's open banking movement, we're not waiting for the future—we're building it. If you're bold, curious, and ready to help shape the future of finance, we'd love to meet you.

About the Reliability Team

As a Reliability Engineer, you will play a pivotal role in ensuring the stability, performance, and reliability of Flinks Fintech product platforms, and monitoring & alerting systems. You will serve as an expert in both software development and system support, working closely with engineering, operations, and product teams to troubleshoot complex issues, resolve incidents, and continuously improve the technical foundation of our products. This role demands a combination of advanced coding skills, incident management experience, and an understanding of the fin-tech industry.

What You'll Do
  • Develop and maintain code to quickly resolve product issues, ensuring fast recovery and long-term system stability
  • Provide live operational support across multiple client applications, monitoring services and alerts to detect and resolve critical failures with minimal downtime
  • Own and troubleshoot complex incidents, conduct root cause analyses, and implement long-term solutions—adhering to SLAs and internal SLOs
  • Build monitoring dashboards and alerting systems to proactively detect and address issues, supporting system scalability and stability
  • Analyze operational metrics and KPIs to identify trends, surface client pain points, and drive improvements
  • Automate tooling and processes to improve efficiency and reduce manual work across LiveOps
  • Collaborate with cross-functional teams to deliver lasting fixes for production issues and contribute to technical analyses of product gaps
  • Lead and mentor reliability engineers, providing guidance and ensuring consistent delivery of high-quality work
  • Participate in post-incident reviews, documenting outcomes and driving preventative action items
  • Support after-hours on-call coverage as part of the LiveOps rotation
Qualifications
  • 5+ years of experience with .NET Framework (C#), ensuring production system stability
  • Strong coding, debugging, and troubleshooting skills, particularly in performance optimization of large-scale applications
  • Operationally focused with expertise in incident management and resolving live production issues
  • Proven experience in building and maintaining reliable monitoring and alerting systems in high-demand environments, with a focus on production support
  • Strong knowledge of Kubernetes, Docker, and cloud platforms (GCP preferred)
  • Proficiency with monitoring tools like Prometheus, Grafana, and Kibana
  • Experience with incident ticketing/documentation tools like FreshDesk and Confluence
  • Critical thinker who can identify system weaknesses and find innovative solutions
  • Strong project management skills with a focus on scalability and system stability
Nice to haves
  • ITIL Service Management certification (or equivalent) is highly desired, such as ITIL v3, ITIL v4, or other equivalent certifications
  • Experience with PowerBI, web scraping, or Golang
The Interview Process
  • Head of People Ops
  • Case Assignment & Presentation
  • Director Interview
Seniority level
  • Mid-Senior level
Employment type
  • Full-time
Job function
  • Engineering and Information Technology
  • Industries
  • Technology, Information and Internet

Referrals increase your chances of interviewing at Flinks by 2x

#J-18808-Ljbffr

  • Brasil Rosebel Gold Mines N.V. Tempo inteiro

    OverviewAt Rosebel Gold Mines N.V., we are seeking a highly skilled and motivated MillReliability Engineer to join our Mill department. As a Reliability Engineer, you will play a crucial role in ensuring the smooth operation and maintenance of our fixed asset milling equipment. In this role, you will play a crucial role in enhancing the team's understanding...

  • Site Reliability Engineer

    2 semanas atrás


    Brasil Seedify Tempo inteiro US$90.000 - US$120.000 por ano

    Seedify is a leading cryptocurrency launchpad platform dedicated to fostering innovation and success in the Web3 space. Our mission is to identify and assist promising teams and projects and offer outstanding returns to our investor base.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer with extensive experience in DevOps,...


  • Brasil beBeeReliability Tempo inteiro US$90.000 - US$120.000

    Reliability Engineering LeadThe ideal candidate will possess a strong understanding of reliability principles and methodologies, with expertise in design for reliability (DfR) activities to incorporate reliability into early design phases. Experience with material selection, derating, redundancy and tolerance stack up analysis is essential.A bachelor's...


  • Brasil beBeeReliability Tempo inteiro US$140.000 - US$170.000

    Unlock the Art of Reliability EngineeringJob DescriptionWe are seeking a seasoned reliability engineer to join our team. In this role, you will be responsible for ensuring the stability and performance of our services.Your primary focus will be on designing, building, and maintaining scalable and efficient systems that meet the needs of our users.You will...

  • Site Reliability Engineer

    2 semanas atrás


    Brasil Aubay Portugal Tempo inteiro

    Aubay Portugal is a multinational French company, in Portugal since 2007. We have offices in Lisbon and Oporto and we are a specialized consultant in Management, Implementation, Development and Maintenance of Information Systems. We have more than 150 active partners and we operate in sectors such as banking, insurance, telecommunications, services, energy...

  • Reliability Engineer

    2 semanas atrás


    Brasil beBeeReliability Tempo inteiro R$80.000 - R$120.000

    Job Title: Reliability Engineering SpecialistWe are seeking a skilled Reliability Engineering Specialist to join our team. As a key member of our infrastructure and operations group, you will be responsible for creating scalable and highly reliable software systems.The successful candidate will have expertise in monitoring tools, automated tasks, and release...


  • Brasil beBeeReliability Tempo inteiro R$54.150 - R$81.150

    System Reliability Engineer RoleWe are seeking a highly motivated and skilled System Reliability Engineer to join our dynamic team. The ideal candidate will have a foundational understanding of monitoring and observability tools, such as Grafana and Dynatrace, and a keen interest in ensuring system reliability and performance.Responsibilities:Implement and...


  • Brasil Housecall Pro Tempo inteiro

    Join to apply for the Senior DevOps Site Reliability Engineer role at Housecall Pro Join to apply for the Senior DevOps Site Reliability Engineer role at Housecall Pro Get AI-powered advice on this job and more exclusive features. TO BE CONSIDERED FOR THIS ROLE, PLEASE SUBMIT AN UPDATED RESUME TRANSLATED TO ENGLISH Who is Housecall Pro? Housecall Pro is...


  • Brasil Kraken Tempo inteiro

    OverviewSite Reliability Engineer - Data Platform role at Kraken. Join our Data Infrastructure team to uphold the reliability, scalability, and efficiency of our data platform. ResponsibilitiesDesign the data governance mechanisms that ensure our lakehouse is easy to interact with, secure and in compliance with all applicable regulations. Implement the...


  • Brasil Articul8 AI Tempo inteiro

    Senior Site Reliability Engineer (SRE) - (Brazil)Senior Site Reliability Engineer (SRE) - (Brazil)2 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. About Us Articul8 AI is at the forefront of Generative AI innovation, delivering cutting-edge SaaS products that transform how businesses operate. Our...