All roles

Member of Technical Staff (Software Engineer)

Remote · USA Full-time New today

Cerebras Systems is a leader in AI technology, known for building the world's largest AI chip. They are seeking a Member of Technical Staff (Software Engineer) to implement infrastructure for high-performance inference services and collaborate with cross-functional teams to enhance the inference pipeline.

Responsibilities

  • Implement infrastructure to support high-performance, low-latency inference service
  • Deploy and configure Kubernetes services to ensure scalability and reliability of inference workloads
  • Optimize resource allocation and auto-scaling policies to handle variable inference demand while minimizing operational costs
  • Integrate inference services with containerized environments using Docker and Kubernetes for orchestration
  • Ensure high availability and fault tolerance by implementing multi-region deployments and disaster recovery strategies
  • Develop Python-based scripts and APIs to streamline data preprocessing, inference execution, and post-processing for real-time inference tasks
  • Collaborate with machine learning engineers to validate inference accuracy and performance against functional and latency requirements
  • Triage and resolve defects in the service by analyzing logs, metrics, and distributed traces
  • Debug issues related to model deployment, container orchestration, or networking configurations, documenting steps to reproduce and root-cause defects
  • Collaborate with cross-functional teams to address performance regressions, scalability issues, or integration failures in the inference pipeline
  • Develop automated scripts to detect and mitigate common failure modes, improving system reliability
  • Author detailed technical documentation for infrastructure configurations, inference workflows, and APIs, ensuring clarity for internal teams and external customers
  • Work with product management and user experience teams to define requirements for inference service interfaces, including configuration, monitoring, and event logging
  • Document and track defects, enhancements, and release notes using tools like Jira and Git, ensuring version control and traceability
  • Participate in release planning and prioritization discussions to align infrastructure development with customer needs and business objectives

Skills

  • Master's degree or foreign equivalent degree in Computer Science, or a related field and 1 year of experience as Software Developer, Student/Intern (Software Developer), Member of Technical Staff (Software Engineer), Software Engineer, or a related occupation required
  • Docker and Kubernetes
  • Java or C++
  • ActiveMQ and Kafka
  • Python or Groovy
  • JavaScript or TypeScript
  • Linux
  • SQL, OracleDB, and Redis
  • Git

Benefits

  • Telecommuting permitted
  • Job stability with startup vitality
  • Simple, non-corporate work culture that respects individual beliefs
  • Continuous learning, growth and support of those around them

Company Overview

  • Cerebras Systems is the world's fastest AI inference. We are powering the future of generative AI. It was founded in 2015, and is headquartered in Sunnyvale, California, USA, with a workforce of 501-1000 employees. Its website is https://cerebras.ai.
  • Company H1B Sponsorship

  • Cerebras has a track record of offering H1B sponsorships, with 5 in 2026, 31 in 2025, 16 in 2024, 18 in 2023, 17 in 2022, 34 in 2021, 23 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles

    [Remote] AI Trainer - Automobitive Expertise Required

    Remote · USA Full-time

    [Remote] Croatian Language Speakers

    Remote · USA Full-time

    [Remote] AI Trainer - Advanced Mandarin Fluency (FR)

    Remote · USA Full-time

    [Remote] AI Generalist (No Experience Required) - Freelance AI Trainer Project

    Remote · USA Full-time

    Performance Modeling Engineer ~2

    Remote · USA Full-time

    [Remote] Spanish Language Speaker

    Remote · USA Full-time

    [Remote] Linguistic AI Auditor (Simplified Chinese)

    Remote · USA Full-time

    [Remote] Taiwanese Dialect Specialist - Freelance AI Trainer Project

    Remote · USA Full-time

    [Remote] AI Trainer - Computer Scientist (EST)

    Remote · USA Full-time

    [Remote] Czech Language Speakers

    Remote · USA Full-time

    Experienced Part-Time Evening Remote Data Entry Specialist – Flexible Work Arrangement

    Remote · USA Full-time

    Experienced Part-Time Remote Customer Service Representative – Delivering Exceptional Service to Global Shoppers

    Remote · USA Full-time

    RN – Registered Nurse Navigator Triage – Population Health

    Remote · USA Full-time

    Associate Project Manager Knowledge Content Manager Remote

    Remote · USA Full-time

    Experienced Customer Service Representative – Remote USA Opportunity at arenaflex

    Remote · USA Full-time

    Product Marketing Manager

    Remote · USA Full-time

    Part-Time Medical Transcription Jobs from Home for Healthcare Experience Holders

    Remote · USA Full-time

    Forward Deployed Data Engineer

    Remote · USA Full-time

    Remote Sales Enrollment Specialist - Calexico

    Remote · USA Full-time

    Developer Enablement Content Specialist (PID0637)

    Remote · USA Full-time