All roles

Cloud Engineer (Containers & Kubernetes)

Remote · USA Full-time New today

Responsible for identifying and resolving end-to-end performance bottlenecks across distributed systems, Spring Boot services, middleware components, and hybrid cloud environments (private cloud + AWS). This role goes far beyond traditional testing by deeply analyzing container orchestration, networking paths, and system interactions under load. This position maps full system workflows, sets realistic latency budgets, and ensures each component meets its SLOs. Ideal candidates have extensive experience with high-scale, multi-region, and high-transaction platforms (e.g., financial systems, payment processing, or large enterprise SaaS) running in a Cloud environment.

Key Responsibilities

  • Define service-level objectives (SLOs), performance budgets, and latency/throughput targets across services.
  • Architect and champion comprehensive distributed tracing strategies (Dynatrace, AWS X-Ray, etc.).
  • Analyze application, platform, and cloud behavior using deep-dive techniques such as heap dumps, thread dumps, flame graphs, logs, network traces, and storage I/O profiling.
  • Review service and system architectures for performance risks (e.g., synchronous hops, excessive dependencies, misconfigured connection pools, poor cache placement).
  • Conduct and lead root-cause analysis for performance incidents in production and pre-production environments.
  • Develop capacity models and performance baselines for services running across cloud environments.

Areas of Expertise

  • Application Layer: Spring Boot internals, JVM tuning, thread/heap management, concurrency debugging, optimization
  • Container Runtime: PCF, Docker, container resource limits, CPU throttling, memory pressure
  • Orchestrators: PCF, Kubernetes, ECS (autoscaling, pod health, scheduling issues)
  • Networking: Service-to-service hops, TLS overhead, DNS, routing, load balancer configs (F5, Nginx, ALB/NLB), service mesh performance
  • Storage: Latency, IOPS constraints, distributed file system behavior
  • Caching & Middleware: Redis, Hazelcast, NATS, Kafka, RabbitMQ configuration and throughput tuning
  • Databases: Connection pool tuning, slow queries, indexing, replication lag
  • Cloud Layer: AWS compute/storage/network performance, regional latency, cross-cloud traffic patterns

For applications and inquiries, contact: [email protected] Apply tot his job Apply To this Job

Related roles

Sr. Google Cloud Platform Engineer | Strategic Education, Inc. | Remote (United States)

Remote · USA Full-time

AWS EUC Engineer (WorkSpaces, AD / Entra ID, SSO)

Remote · USA Full-time

Senior Software Engineer (Infrastructure)

Remote · USA Full-time

Cloud Engineer Consultant - Technology & Transformation

Remote · USA Full-time

Cloud Engineering Intern

Remote · USA Full-time

Cloud Solution Engineer 4

Remote · USA Full-time

CNAPP Hybrid Prisma Cloud Engineer (Palo Alto Networks)

Remote · USA Full-time

Cloud System Engineer

Remote · USA Full-time

[Remote] Azure Cloud Engineer - This position supports a U.S. federal government contract that requires U.S. citizenship

Remote · USA Full-time

DevOps & Cloud Engineer (Azure + Backend Enablement)

Remote · USA Full-time

Experienced Part-Time Remote Data Entry Specialist – arenaflex Operations

Remote · USA Full-time

Teacher Special Education Instructor PK-12 - Adjunct Faculty

Remote · USA Full-time

Experienced Remote Data Entry Clerk – Logistics and Administrative Support

Remote · USA Full-time

Junior Quantitative Analyst

Remote · USA Full-time

Staff Frontend Engineer | Web Apps & Platform

Remote · USA Full-time

Motion Designer to Elevate Infographic for Services

Remote · USA Full-time

Experienced Customer Support Service Representative – STAT Team – arenaflex

Remote · USA Full-time

NP/PA for Telehealth Post-Acute Care - Dermatology

Remote · USA Full-time

K–Career AI Education Manager

Remote · USA Full-time

Radiology Scheduler - Work from Home | $16.00/hr | Starts 5/21/26

Remote · USA Full-time