
Observability Engineer
3 semanas atrás
An extraordinarily talented group of individuals work together every day to drive TNS' success, from both professional and personal perspectives. Come join the excellence
Overview
TNS is looking for an Observability Engineer to support the design, implementation, and evolution of our observability stack. This role is critical in ensuring the reliability, performance, and scalability of our systems by providing deep visibility into infrastructure and application behavior. You will collaborate with cross-functional teams to define observability standards and drive adoption of best practices across the organization.
Responsibilities
Responsibilities
- Lead the design, implementation, and continuous improvement of the observability stack, including monitoring, logging, and tracing systems.
- Define and enforce observability standards and best practices across engineering teams to ensure consistent instrumentation and visibility.
- Build scalable monitoring solutions that provide real-time insights into system health, performance, and availability.
- Develop and maintain dashboards, alerts, and automated responses to proactively detect and resolve issues before they impact users.
- Collaborate with development, infrastructure, and SRE teams to integrate observability into CI/CD pipelines and production workflows.
- Conduct root cause analysis and post-incident reviews to identify observability gaps and drive improvements.
- Evaluate and implement tools such as Splunk, Splunk Observability Cloud, Netreo to support monitoring and alerting needs.
- Champion a culture of data-driven decision-making by enabling teams to access and interpret observability data effectively.
- Automating observability pipelines and alerting mechanisms.
Qualifications
Qualifications
- 5+ years of experience in Site Reliability Engineering, DevOps, or Observability roles.
- 3+ years of experience in SRE/DevOps.
- Demonstrated success in deploying and managing monitoring tools and observability solutions at scale.
- Hands-on experience with monitoring and observability platforms such as Splunk, Splunk Observability Cloud (O11y), Grafana, Prometheus, Datadog,
- Proven ability to design and implement SLOs/SLIs, dashboards, and alerting strategies that align with business and operational goals.
- Familiarity with incident response, alert tuning, and postmortem analysis.
- Strong scripting or programming skills (e.g., Python, Go, Bash).
- Excellent communication and collaboration skills, with a focus on knowledge sharing and mentorship.
Desired
- Strong understanding of distributed tracing tools like OpenTelemetry, Jaeger, or Zipkin
- Experience integrating observability into CI/CD pipelines and Kubernetes environments.
- Contributions to open-source observability tools or frameworks.
- Strong understanding of distributed tracing tools like OpenTelemetry, Jaeger, or Zipkin.
- Strong knowledge of cloud platforms (AWS, Azure, or GCP) and container orchestration (Kubernetes).
If you are passionate about technology, love personal growth and opportunity, come see what TNS is all about
TNS is an equal opportunity employer. TNS evaluates qualified applicants without regard to race, color, religion, gender, national origin, age, sexual orientation, gender identity or expression, protected veteran status, disability/handicap status or any other legally protected characteristic.
This offer from "Transaction Network Services" has been enriched by Jobgether.com and got a 75% flex score.
-
DevOps Engineer
Há 6 dias
Brasil Flowmentum, Inc. Tempo inteiroDevOps & Platform Engineers We're hiring DevOps/Platform Engineers with strong SRE skills to work on high-scale SaaS platforms. Our stack is heavy on EKS, MongoDB/Atlas , and you'll be tackling database contention, scaling challenges, and complex deployments every day. This role is for problem solvers who thrive on multitasking, navigating ambiguity, and...
-
DevOps Engineer
4 semanas atrás
Brasil Highbrow Technology Inc Tempo inteiroJob Title: DevOps Engineer Contract || Remote 8 hrs overlap with US hours Role: Seeking an experienced DevOps Engineer with strong AWS (EKS, RDS, S3) skills, CI/CD (GitHub Actions) expertise, and a passion for automation. Must be adept at containerized deployments, Linux/Unix administration, and Cloudflare configuration, with scripting ability in Bash,...
-
DevOps Engineer
4 semanas atrás
Brasil Highbrow Technology Inc Tempo inteiroJob Title: DevOps Engineer Contract || Remote 8 hrs overlap with US hoursRole: Seeking an experienced DevOps Engineer with strong AWS (EKS, RDS, S3) skills, CI/CD (GitHub Actions) expertise, and a passion for automation. Must be adept at containerized deployments, Linux/Unix administration, and Cloudflare configuration, with scripting ability in Bash,...
-
DevOps Engineer
Há 6 dias
Brasil Flowmentum, Inc. Tempo inteiroDevOps & Platform Engineers We're hiring DevOps/Platform Engineers with strong SRE skills to work on high-scale SaaS platforms. Our stack is heavy on EKS, MongoDB/Atlas , and you'll be tackling database contention, scaling challenges, and complex deployments every day. This role is for problem solvers who thrive on multitasking, navigating...
-
ElasticSearch Engineer
Há 19 horas
Brasil Puzzle Tempo inteiro US$90.000 - US$120.000 por anoJob Title: ElasticSearch Engineer Location: Remote in Latam Type: Full-Time About the Role We are seeking an experienced Elasticsearch Engineer to design, build, and maintain a scalable search and analytics infrastructure. You will be responsible for developing robust indexing pipelines, optimizing query performance, and supporting advanced use...
-
Site Reliability Engineer
Há 19 horas
Brasil Seedify Tempo inteiro US$90.000 - US$120.000 por anoSeedify is a leading cryptocurrency launchpad platform dedicated to fostering innovation and success in the Web3 space. Our mission is to identify and assist promising teams and projects and offer outstanding returns to our investor base.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer with extensive experience in DevOps,...
-
Site Reliability Engineer, Technical Referent
4 semanas atrás
Brasil dLocal Tempo inteiroSite Reliability Engineer, Technical Referent1 month ago Be among the first 25 applicantsWhy should you join dLocal?dLocal enables the biggest companies in the world to collect payments in 40 countries in emerging markets. Global brands rely on us to increase conversion rates and simplify payment expansion effortlessly. As both a payments processor and a...
-
Enterprise System Reliability Engineer
Há 4 dias
Brasil beBeeReliability Tempo inteiro US$120.000 - US$150.000Site Reliability EngineerWe are seeking an experienced engineer to lead our SRE initiatives and ensure the high availability of our enterprise-grade systems.This role involves designing and implementing modern SRE practices, adopting advanced methodologies, and collaborating with technical stakeholders to drive operational excellence. The ideal candidate...
-
Expert Cloud Platform Engineer
1 dia atrás
Brasil beBeeDevOps Tempo inteiro US$90.000 - US$120.000Job Opportunity: Senior DevOps EngineerJob Description:As a seasoned Senior DevOps Engineer, you will play a pivotal role in designing, building, and securing production-grade, cloud-native platforms that drive real-world impact.About the Role:You'll be at the heart of our engineering team, scaling infrastructure, automating workflows, and ensuring...
-
Digital Platform Engineer
Há 2 dias
Brasil beBeeInfrastructure Tempo inteiro US$90.000 - US$120.000Platform Operations SpecialistWe are seeking a talented platform operations specialist to join our team in developing and scaling data socialization and visualization platforms.This is a hands-on individual contributor role where you will lead by example through design, coding, and problem-solving.Key responsibilities include:Designing, operating, and...