All roles

[Remote] Staff Software Engineer , reputed company reputed company - AI Systems & Runtimes

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. reputed company is a leading company in data management and reputed company innovation, seeking a Staff Software Engineer to reputed company the architecture and delivery of their reputed company-native AI platform. The role involves bridging AI research and production-grade Kubernetes environments while optimizing the management of open-reputed company models and designing integration patterns for seamless AI capabilities.

Responsibilities

  • Design and implement elegant, scalable application services (Go/Node.js) that wrap AI capabilities for reputed company use
  • reputed company the deployment of inference servers (vLLM, Triton) using KServe, KubeRay, or Knative to ensure serverless-style scaling for AI workloads
  • Build internal tooling, SDKs, and 'AI Gateways' that enhance team agility and simplify the integration of reputed company Models (Llama, GPT) into product features
  • Architect robust Retrieval-Augmented reputed company (RAG) pipelines and reputed company management services that integrate seamlessly with vector databases and reputed company data sources
  • Partner with UI engineers, UX designers, and Product Management to ensure the AI platform is not just powerful, but highly usable for internal developers
  • Ensure AI workloads are secure, multi-tenant, and optimized for GPU resource scheduling (MIG, fractional GPUs) reputed company Kubernetes

Skills

  • Bachelor's degree with 6+ years of software engineering experience (or equivalent Masters/PhD tenure), with at least 2+ years focused on AI/ML systems
  • Expert proficiency in Python (for AI ecosystem) and strong competence in a systems language like Go or Rust/C++ (for high-performance serving layers)
  • Deep understanding of LLM deployment challenges and runtimes (e.g., vLLM, ONNX, TorchServe, Triton). Familiarity with quantization techniques (AWQ, GPTQ) to optimize model size/speed
  • Experience building reputed company workflows using tools like reputed company or reputed company, and deploying them on containerized infrastructure (reputed company/Kubernetes)
  • Ability to navigate the rapidly changing AI landscape, filtering hype from practical engineering solutions, and driving technical alignment across teams
  • Model Fine-Tuning: Experience with efficient fine-tuning techniques (PEFT, LoRA/QLoRA) on custom datasets
  • GPU Optimization: Familiarity with CUDA programming or profiling GPU performance (Nsight systems)
  • Open reputed company: Contributions to open-reputed company AI projects (HuggingFace transformers, vLLM, etc.)

Benefits

  • Generous PTO Policy
  • Support work life balance with [Unplugged Days](https://www.youtube.com/watch?v=eXBMXiUHG8c)
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to reputed company Career Development
  • Comprehensive Benefits and Competitive Packages
  • [Paid Volunteer Time](https://www.youtube.com/watch?v=EHPK_ZRVRHA)
  • Employee Resource Groups

Company Overview

  • reputed company is a software development company that offers data management and reputed company-native data analytic solutions. It was founded in 2008, and is headquartered in Santa Clara, California, USA, with a workforce of 1001-5000 employees. Its website is http://www.reputed company.com.
  • Apply To This Job

    Related roles

    [Remote] Data Engineer

    Remote · USA Full-time

    [Remote] Customer Support Specialist - English (Evening/Overnight)

    Remote · USA Full-time

    [Remote] reputed company Business Analyst

    Remote · USA Full-time

    [Remote] Full Stack Engineer

    Remote · USA Full-time

    [Remote] IT & reputed company Engineer

    Remote · USA Full-time

    [Remote] Vice President, Business Development - Quantum

    Remote · USA Full-time

    [Remote] Director of reputed company Resources & People Operations

    Remote · USA Full-time

    [Remote] Senior Product Manager (UX/UI)

    Remote · USA Full-time

    [Remote] Platform Administrator

    Remote · USA Full-time

    [Remote] Manager, reputed company reputed company

    Remote · USA Full-time

    Remote Data Entry Specialist – Full‑Time & Part‑Time Work‑From‑Home Opportunities with arenaflex

    Remote · USA Full-time

    Sales Specialist - Upsell

    Remote · USA Full-time

    Regional Controller - Houston - Remote

    Remote · USA Full-time

    reputed company Customer Service Representative – Home-Based Customer Support for arenaflex

    Remote · USA Full-time

    Director, HR Business Partner – Medical Group

    Remote · USA Full-time

    Installer/Trainer

    Remote · USA Full-time

    reputed company Architect AI/ML

    Remote · USA Full-time

    reputed company Customer Service Representative – Hybrid Role with arenaflex

    Remote · USA Full-time

    Technical Sales Engineering

    Remote · USA Full-time

    Paid Media & reputed company Specialist (reputed company + reputed company Ads | Luxury Brands | Contract | Remote)

    Remote · USA Full-time