Site Reliability Engineer
1 semana atrás
SummaryWe at Softensity are looking for a Site Reliability Engineer (SRE) – This is a dynamic and hands-on role within a global, collaborative SRE environment. The SRE Technical Member will contribute to building resilient systems, automating operations, and ensuring the platform meets high standards for performance, reliability, and security.You will be working closely with Central SRE, DevOps, InfoSec, and Agile development teams to maintain platform stability, scalability, and performance. Project Context The Platform is a distributed, cloud-based system serving hundreds of geographically dispersed clients. It operates on Microsoft Azure using a microservices architecture, combining open-source, licensed, and internally developed tools for provisioning, deployment, monitoring, and logging. SREs own the entire production stack — from application functionality to infrastructure resilience — ensuring availability, reliability, and scalability in a 24/7 operational environment. This role requires problem-solving through data, collaboration, and technical expertise, maintaining a balance between engineering innovation and practical delivery. Key Responsibilities Collaborate with Central SRE, DevOps, and InfoSec teams on new projects, platform builds, and deployments. Contribute to the design, implementation, and operation of large-scale, Azure-based platforms. Apply industry best practices in monitoring, alerting, reporting, and cloud architecture. Participate in infrastructure, application, and security planning, focusing on scalability, redundancy, and data preservation. Support high-availability topologies with development teams. Produce documentation and weekly operational status reports, detailing project progress and key metrics. Provide engineering and support for technical infrastructure, cloud, databases, and application performance. Manage incident response, change management, and user permissions following SRE best practices (Google SRE model). Maintain close collaboration between Application, Central SRE, DevOps, InfoSec, and business units. Assist in configuring and onboarding new applications into the Azure DevOps (ADO) platform. Core Technical Skills Operational Skills Strong understanding of SRE fundamentals: monitoring, alerting, reporting, performance, availability, and incident response. Hands-on experience with CI/CD tools (Git, Azure Pipelines, Ansible, etc.). Infrastructure as Code (IaC) design, scripting, and setup. Deep knowledge of Azure Web Services — installation, configuration, and management. Experience administering Microsoft applications (.NET, C#, Angular) with focus on automation, optimization, and security. Proficiency in Cosmos DB and MS SQL operational tasks. Excellent troubleshooting, root-cause analysis, and problem-solving skills. Experience with disaster recovery, scalability testing, and capacity planning. Automation Skills Expertise with cloud deployment and automation tools (Git, Azure DevOps, Ansible, etc.). Ability to automate routine deployment, monitoring, and administrative tasks. Write and maintain documentation and custom tools for monitoring and performance optimization. Scripting & Development Proficiency in Shell scripting and API troubleshooting for production support. Experience designing, authoring, and maintaining .NET / C# code. Capability to deliver hotfixes and operational patches (.NET & Angular). Working knowledge of automation scripting languages for operational tools development. Qualifications Bachelor’s degree in a technical discipline (Computer Science, Engineering, or related field). 5+ years of industry experience in SRE, DevOps, or related technical operations roles. Proven experience in cloud infrastructure, automation, and application reliability engineering within large-scale, enterprise environments. Why Join Us?We are passionate about top quality talent and giving our team members the tools they need for them to keep on growing and learning. The sky is truly the limit and we want you to feel challenged and motivated in every single project that you're a part of all while working with cutting edge technologies and amazing clients.What to expect?Coursera credentialsRemote workLearning opportunitiesSoftensity is an equal-opportunity employer. All qualified applicants are considered without regard to gender, identity, or personal background.
-
Site Reliability Engineer Sr
1 semana atrás
Brazil, BR Mercado Eletrônico Tempo inteiroO Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração.Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...
-
Site Reliability Engineer
2 semanas atrás
Brazil, BR Psm Company Tempo inteiroSobre a vagaA PSM Company é especializada na identificação de Talentos para as áreas de TI / Telecom como também para as áreas operacionais e administrativas. Nossa história de sucesso, está baseada em nosso modelo de negócios que proporcionam assertividade e qualidade no processo seletivo, baixo Turn Over e isenção de riscos e passivos...
-
DevOps Engineer
1 semana atrás
Brazil, BR Flowmentum, Inc. Tempo inteiroWe’re Flowmentum and our clients are fast-moving teams building reliable, scalable, and secure infrastructure for companies shaping the future of AI, fintech, cloud services, and beyond.Our engineers work on high-traffic, mission-critical systems that power millions of users across the globe.We believe in autonomy, ownership, and solving hard problems —...
-
Platform Engineer
Há 2 dias
Brazil, BR Stefanini Brasil Tempo inteiroDescrição da VagaBuscamos um(a) Platform Engineer / DevOps Engineer SR para atuar na construção e evolução de serviços de infraestrutura de plataforma, adotando práticas modernas de automação, observabilidade e infraestrutura como produto, suportando times de desenvolvimento com ambientes escaláveis, seguros e resilientes.Atuação para um cliente...
-
Senior Site Reliability Engineer
Há 5 dias
Brazil, BR TQI Tempo inteiroEstamos em busca de um(a) SRE Sênior para atuar em ambientes críticos e altamente escaláveis, garantindo confiabilidade, performance e segurança em soluções baseadas em AWS. Além das responsabilidades técnicas, você também será um parceiro estratégico do time de pré-venda, ajudando a entender as necessidades dos clientes e propondo as melhores...
-
Data Engineer
2 semanas atrás
Brazil, BR Zunzun Solutions Tempo inteiroSummary:We are seeking a highly skilled Data Engineer (Azure Databricks) to design, implement, and optimize enterprise-grade data pipelines. In this role, you will leverage Azure Databricks, Azure Data Factory, SQL Server, and Python to enable scalable, governed, and performant data solutions. You will play a key role in modernizing our data platform on the...
-
Databricks Data Engineer
2 semanas atrás
Brazil, BR GlobalSource IT Tempo inteiroDatabricks Data EngineerFully Remote Contract We’re looking for a hands-on Databricks Data Engineer with strong experience building scalable data pipelines using Spark, PySpark, SQL, and Delta Lake. This role focuses on ingesting data from multiple sources, transforming it for analytics, and publishing high-quality datasets and...
-
Control Engineer
1 semana atrás
Brazil, BR TARGAN Inc. Tempo inteiroControls Engineer – Sustaining Remote | Global Support Role | Full-TimeTARGANAbout the RoleTARGAN is transforming animal agriculture through advanced automation — and we’re looking for a Controls Engineer – Sustaining to help keep our systems running smoothly around the world.This is a hands-on, global support role that bridges Field Services and R&D...
-
Network Implementation Engineer
4 semanas atrás
Brazil, BR HCLTech Tempo inteiroJob Title: L3 Network Operations and Implementation Engineer – LAN/WLANJob Summary:We are seeking a skilled and motivated L3 Network Engineer provide Level 3 network support and to implement LAN and WLAN infrastructure at sites across the North and South America. The ideal candidate will hold at least a CCNA certification, possess strong experience in...
-
DevOps Engineer
1 semana atrás
Brazil, BR Flowmentum, Inc. Tempo inteiroDevOps & Platform EngineersWe’re hiring DevOps/Platform Engineers with strong SRE skills to work on high-scale SaaS platforms. Our stack is heavy on EKS, MongoDB/Atlas, and you’ll be tackling database contention, scaling challenges, and complex deployments every day.This role is for problem solvers who thrive on multitasking, navigating ambiguity, and...