Site Reliability Engineer

1 dia atrás


Fortaleza, Brasil Agileengine Tempo inteiro

OverviewSite Reliability Engineer (Middle/Senior) ID***** — AgileEngineAgileEngine is an Inc. **** company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.Join to apply for the Site Reliability Engineer (Middle/Senior) ID***** role at AgileEngine.What you will doShift: Monday – Thursday 8AM – 7PM PST (11AM – 10PM EST) with rotating on-call;On call shifts: every 6 weeks, for one week as primary responder and next week as secondary;Manage alerts daily, check systems, and escalate issues as needed;Be part of a team that provides 24×7 on-call support for critical SaaS events;Be available in case of emergencies when team members are not available or need help;Document issues and remediation steps;Proactively create appropriate monitors in the EKS/K8S ecosystem;Deploy to EKS/K8s cluster using Terraform and Helm;Learn and maintain existing infrastructure running under Docker Swarm;Improve existing infrastructure health by implementing checks and scripts to correct known issues;Maintain and develop deployment code;Automate manual tasks;Implement/integrate new technologies in our Cloud Infrastructure;Collaborate with other teams and departments to provide the highest level of support and assistance;Apply a real customer focus when planning deployments/updates, having the customer in the forefront of the mind, and considering the impact on them before making changes;Work closely on solutions with Support, Customer Success, Migration, and Professional Services teams to provide the best in class SaaS service to our customers;Perform RCA and take necessary corrective actions to prevent the recurrence of issues;Create and assign alert-related actions to the appropriate team after the investigation;Handle support requests for environment-specific actions;Identify and provide automation requirements to improve RCA.MUST HAVES2+ years of professional experience;Experience working with Datadog;Hands-on experience as an AWS Cloud Engineer;Working knowledge of EKS/Terraform/Helm;Working experience with Docker and Docker Swarm;Good understanding of AWS IAM roles and policies;Experience logging and monitoring AWS resources using CloudWatch logs;Experience working in a Linux environment;Proficient in Bash and/or Python scripting;A strong understanding of web technologies such as REST APIs;Working experience with monitoring solutions, such as Grafana and Prometheus;Excellent oral and written communication skills;Customer-facing communication skills to effectively explain issues and RCAs to them;Experience in Product/Application Support for SaaS-based products;Understanding of APIs, Databases, Systems Architecture, and Design;Designing, implementing, and operating in a DevSecOps;Excellent communication skills, both written and verbal;Ability to work independently as well as within a collaborative environment;A technical aptitude with the desire to learn new and evolving technologies;Upper-Intermediate English level.NICE TO HAVESExperience with GCP or Azure;Certifications: AWS Certified DevOps Engineer – Professional or AWS Certified Advanced Networking Specialty.PERKS AND BENEFITSProfessional growth: Accelerate your professional journey with mentorship, TechTalks, and personalized growth roadmaps.Competitive compensation: We match your ever-growing skills, talent, and contributions with competitive USD-based compensation and budgets for education, fitness, and team activities.A selection of exciting projects: Join projects with modern solutions development and top-tier clients that include Fortune 500 enterprises and leading product brands.Flextime: Tailor your schedule for an optimal work-life balance, by having the options of working from home and going to the office – whatever makes you the happiest and most productive.Seniority levelMid-Senior levelEmployment typeFull-timeJob functionIndustriesIT Services and IT ConsultingReferrals increase your chances of interviewing at AgileEngine by 2xGet notified about new Senior Site Reliability Engineer jobs in Fortaleza, Ceará, Brazil.Site Reliability Engineer - Remote Work | REF#******We're unlocking community knowledge in a new way.Experts add insights directly into each article, started with the help of AI.#J-*****-Ljbffr


  • Site Reliability Engineer

    3 semanas atrás


    Fortaleza, Brasil INDI Staffing Services Tempo inteiro

    3 days ago Be among the first 25 applicants Direct message the job poster from INDI Staffing Services At INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work. Overview...


  • Fortaleza, Brasil BairesDev Tempo inteiro

    Site Reliability Engineer - Remote Work: At BairesDev, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact...


  • Fortaleza, Brasil Mercado Eletrônico Tempo inteiro

    O Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2B. Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, governança e colaboração. Com escritórios no Brasil, Estados Unidos, México e Portugal, contabiliza mais de 1 milhão de fornecedores, 10 mil...


  • Fortaleza, Brasil AgileEngine Tempo inteiro

    Overview Site Reliability Engineer (Middle/Senior) ID38916 — AgileEngine AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to...


  • Fortaleza, Brasil AgileEngine Tempo inteiro

    OverviewSite Reliability Engineer (Middle/Senior) ID38916 — AgileEngine AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to...


  • Fortaleza, Brasil Bebeesoftware Tempo inteiro

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer and Software Developer to join our team.This is a unique opportunity to work with a cutting-edge technology consulting company, driving innovation and excellence in software development and reliability engineering.The ideal candidate will have 4-6 years of experience in DevOps and...

  • Site Reliability Engineer

    4 semanas atrás


    Fortaleza, Brasil AgileEngine Tempo inteiro

    Site Reliability Engineer (Middle) ID38916Join to apply for the Site Reliability Engineer (Middle) ID38916 role at AgileEngine . OverviewAgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and...


  • Fortaleza, Brasil Agileengine Tempo inteiro

    Site Reliability Engineer (Middle) ID*****Join to apply for the Site Reliability Engineer (Middle) ID***** role at AgileEngine.OverviewAgileEngine is an Inc. **** company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI/ML, and our...

  • Site Reliability Engineer

    3 semanas atrás


    Fortaleza, Brasil AgileEngine Tempo inteiro

    Site Reliability Engineer (Middle) ID38916 Join to apply for the Site Reliability Engineer (Middle) ID38916 role at AgileEngine . Overview AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML,...


  • Fortaleza, Brasil FullStack Labs Tempo inteiro

    Join to apply for the AWS/Kubernetes DevOps Engineer - Remote - Latin America role at FullStack Labs 2 days ago Be among the first 25 applicants Join to apply for the AWS/Kubernetes DevOps Engineer - Remote - Latin America role at FullStack Labs About FullStackFullStack is the most transparent IT talent network, connecting highly skilled individuals with top...