Inference Systems Engineer

1 hora atrás


Ituiutaba, Brasil Noxx Tempo inteiro

Inference Systems Engineer Remote Infrastructure / Serving Systems $5,651 - $6,469/month USD Role Overview As an Inference Systems Engineer, you will own the serving runtime that powers production LLM inference.This is a deeply technical role focused on system performance and stability: optimizing request lifecycle behavior, streaming correctness, batching/scheduling strategy, cache and memory behavior, and runtime execution efficiency.You will ship changes that improve TTFT, p95/p99 latency, throughput, and cost efficiency while preserving correctness and reliability under multi-tenant load.You will collaborate closely with platform/infrastructure operations, networking, and API/control-plane teams to ensure the serving system behaves predictably in production and can be debugged quickly when incidents occur.This role is for engineers who can reason about the entire inference pipeline, validate improvements with rigorous measurement, and operate with production-grade discipline.Responsibilities Own the end-to-end serving runtime behavior: request lifecycle, streaming semantics, cancellation, retries interaction, timeouts, and consistent failure modes.Design and implement batching and scheduling strategy: dynamic batching, admission control, fairness under mixed tenants, priority lanes, and backpressure mechanisms to prevent cascading failures.Optimize performance at the systems level: reduce time-to-first-token, improve tail latency stability, increase tokens/sec throughput, and improve accelerator utilization under realistic workloads.Improve memory behavior and cache efficiency: KV-cache policies, fragmentation control, eviction strategies, and safeguards against OOM cliffs and performance thrash.Drive runtime execution optimizations: operator-level improvements, quantization integration, compilation/tuning paths where appropriate, and parameterization that produces stable performance across deployments.Establish a performance measurement discipline: reproducible benchmarks, realistic traffic traces, profiling workflows, regression detection gates, and dashboards tied to production outcomes.Build production readiness into the system: feature-flagged rollouts, canarying, safe configuration changes, and incident playbooks that reduce MTTR.Partner with networking and infrastructure operations to align deployment topology, failure domains, and capacity constraints to performance and reliability goals.Collaborate with product and API teams to ensure the serving layer's guarantees are reflected accurately in external interfaces and customer expectations.Requirements 5+ years building high-performance systems (model serving, GPU systems, performance engineering, or low-latency distributed systems).Strong understanding of LLM inference tradeoffs: batching vs latency, prefill vs decode dynamics, cache behavior, memory pressure, and tail latency causes.Comfort working across Python/C++ stacks with production profiling and debugging tools.Track record of shipping performance improvements that hold up under production variance and operational constraints.Strong engineering hygiene: tests, instrumentation, documentation, and careful rollout discipline.Ability to communicate clearly across teams and operate calmly during incidents.



  • Ituiutaba, Brasil Bebeeperformance Tempo inteiro

    Job DescriptionWe are seeking a highly skilled Inference Systems Engineer to join our team. This is a deeply technical role focused on system performance and stability.Owning the serving runtime that powers production LLM inference, you will optimize request lifecycle behavior, streaming correctness, batching/scheduling strategy, cache and memory behavior,...

  • Linux System Engineer

    Há 8 horas


    Ituiutaba, Brasil Incomm Payments Tempo inteiro

    We are seeking a highly skilled and experienced Senior Linux System Engineer to join our In Comm Operations team. Ideally, you will have a strong background in Red Hat and Oracle Linux system administration, automation with Ansible, as well as deep expertise in Linux patching, scripting, and GIT version control.100% Remote + CLT + Benefits (Health Insurance...

  • Ai Engineer

    1 hora atrás


    Ituiutaba, Brasil Cor Tempo inteiro

    AI Engineer (Early-Stage AI Startup)RemoteUS / Los Angeles Time (preferred)Startup environment — high ownership, high impactCOR ) is an early-stage AI startup building intelligent agents for agencies and modern teams. We're not experimenting — we're building real AI systems used in production.We're looking for an AI Engineer who loves building, shipping,...

  • Linux System Engineer

    Há 8 horas


    Ituiutaba, Brasil Incomm Payments Tempo inteiro

    We are seeking a highly skilled and experiencedSenior Linux System Engineerto join our InComm Operations team. Ideally, you will have a strong background in Red Hat and Oracle Linux system administration, automation with Ansible, as well as deep expertise in Linux patching, scripting, and GIT version control.100% Remote + CLT + Benefits (Health Insurance...

  • Lead Software Engineer

    2 minutos atrás


    Ituiutaba, Brasil Vaga Para Lead Software Engineer Tempo inteiro

    Lead Software Engineer Remote – must be based in Brazil Platform Science is an open IoT platform that partners with fleets, developers, vehicle manufacturers, and equipment providers to deliver revolutionary solutions to supply chain professionals worldwide. We are seeking a Lead/Senior Software Engineer to join our Video team. In this high-impact role you...

  • Data Engineer

    46 minutos atrás


    Ituiutaba, Brasil Georgiatek Systems Inc. Tempo inteiro

    We are hiring!Greetings!My name is Kathiresan, and I'm with Georgia Tek Please find the job description below.If you're interested, I would appreciate it if you could share your updated resume with me.Role: Senior Data Engineer Location: Remote Type: Contract Job Description: GraphQL API design (federated architecture, subgraphs, schema optimization...


  • Ituiutaba, Brasil Bebeesoftwaredeveloper Tempo inteiro

    Job Title: Senior Embedded Software DeveloperJob Description:We are seeking a skilled senior software engineer to join our team and play a key role in designing and implementing embedded software for high-precision systems.You will work closely with electrical, mechatronic, and system engineers to define software requirements, implement new modules in C/C++...


  • Ituiutaba, Brasil Bebeecybersecurity Tempo inteiro

    Role OverviewA Systems Engineer is required to design, implement and support complex IT systems for enterprise clients. The ideal candidate will work hands-on with modern infrastructure technologies, collaborate with internal engineering teams and provide expert guidance to help organizations optimize their IT environments.Key ResponsibilitiesDesign and...

  • Database Engineer

    46 minutos atrás


    Ituiutaba, Brasil Georgiatek Systems Inc. Tempo inteiro

    Job Title: L3 MySQL PostgreSQL Database Engineer Location: Brazil (Remote) About the Role We are looking for a senior-level database engineer to lead the design, deployment, and automation of PostgreSQL and MySQL platforms across both cloud infrastructure and Kubernetes environments.This role requires strong experience building resilient systems using...

  • Back End Developer

    Há 8 horas


    Ituiutaba, Brasil Georgiatek Systems Inc. Tempo inteiro

    Backend Engineer Location: Brazil (Remote)Role OverviewWe are looking for a Senior Backend Engineer to join our Content Systems – Convergence team.In this role, you will design, build, and scale backend services that power content platforms used at scale.You will work closely with product, platform, and DevOps teams to deliver reliable, secure, and...