
Stability Expert
2 semanas atrás
As a Site Reliability Engineer, you will play a pivotal role in ensuring the stability and scalability of our systems. This is an opportunity to showcase your technical expertise and contribute to the success of our organization.
- Handle major incidents via a structured approach and provide timely updates until resolution.
- Perform thorough application troubleshooting and identify preventive measures.
- Manage requests related to deployments, feature toggles, and data fixes.
- Coordinate with cross-functional teams to resolve production incidents.
- Enhance monitoring capabilities using tools like Dynatrace, Kibana, and Splunk.
- Develop and improve monitoring scripts and alerts based on incident learnings.
- Respond to customer escalations and coordinate with Support & Engineering teams.
- Support planned activities and respond to ad-hoc requests from engineering teams.
- Deep experience in DevOps and Production Support.
- Experience in automation and CI/CD practices.
- Familiarity with cloud platforms (GCP, AWS, or Azure).
- Hands-on experience with monitoring tools such as Dynatrace, Kibana, and Splunk.
- Strong troubleshooting skills and ability to deep dive into application issues.
- Excellent communication and coordination skills across teams.
-
Servicenow It Expert
1 dia atrás
Recife, Brasil Bebeeservicenow Tempo inteiroJob Title: Servicenow Developer LeadRole Overview:We are seeking a highly skilled Servicenow Developer with extensive ITOM experience to lead the development and enhancement of our platform.The ideal candidate will have deep expertise in CMDB, Discovery, Integrations, and related ITOM modules.Key Responsibilities:Design, develop, and implement complex...
-
Chief Cloud Architect
2 semanas atrás
Recife, Brasil beBeeCloud Tempo inteiroCloud Engineering Expert We're seeking a seasoned Cloud Engineer to join our team. As a Cloud Engineer, you will design and deploy cloud-native systems on AWS that meet high standards for performance and scalability. Your key responsibilities include owning Kubernetes clusters: architecture, automation, observability, and performance tuning to ensure...