Senior Specialist Field Engineer - Compute Infrastructure

at CoreWeave

CoreWeaveLivingston, NJ / New York, NY / Sunnyvale, CA / San Francisco, CA / Bellevue, WA / Dallas, TXPosted 2026-06-11

Want this job?

Let DoneWithWork tailor your resume to this exact posting, write the cover letter, and submit the application for you.

Apply with DoneWithWork — $19.99/mo

View original posting →

Job description

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com. What You'll Do: The Field Engineering organization at CoreWeave is dedicated to ensuring every customer running AI workloads at scale has a seamless, reliable, and high-performance experience. This team supports the infrastructure that powers the AI revolution—working across data centers, hardware systems, and customer workloads to maintain the integrity of our cloud platform. Field Engineering aligns closely with internal and customer engineering teams, offering valuable insights from the field and the chance to shape the CoreWeave product roadmap and development. About the role: As a Specialist Field Engineer - Compute Infrastructure at CoreWeave, you'll own the technical path for some of our largest customers as they go from facility and rack design to a validated, production-ready supercomputer. Working alongside the teams that build and operate each layer, you are the deep technical expert who turns raw data center hardware—racks, GPUs, high-speed fabric, firmware—into reliable compute that customers can train and inference on at scale, spanning infrastructure engineering, provisioning, validation, operations, and support. You'll engage hands-on across the entire customer lifecycle: leading new GPU cluster bring-up and acceptance, driving InfiniBand/RoCE fabric validation and HPC performance benchmarking, defining how we operate customer bare-metal fleets at rack-level-and-up (IT service, break-fix, network, and firmware), and standing up locked-down, security-sensitive environments for our most strategic AI customers. You'll partner closely with Data Center Operations, Fleet Operations, Networking, and Product Engineering, and your work in the field will directly shape how CoreWeave delivers compute infrastructure. If you're driven by innovation, thrilled by the possibilities of what specialized compute can enable, and eager to be part of a team that's shaping the future, then CoreWeave is the place for you. Join us and let's embark on this adventure together! In this role, you will: Serve as the primary technical point of contact for customers, establishing strong technical relationships and ensuring their success with CoreWeave's cloud infrastructure offerings, focusing on bare-metal compute infrastructure and end-to-end cluster delivery within high-performance compute (HPC) environments. Own the technical path from facility and rack design to a validated, production-ready supercomputer—spanning logical design, infrastructure engineering, provisioning, validation, operations, and support. Lead bring-up and acceptance of new large-scale GPU clusters, driving InfiniBand/RoCE fabric validation, HPC performance benchmarking (e.g., NCCL, ib_write_bw), and remediation of fabric, optics, firmware, and node-level issues to meet customer performance targets. Define and operationalize models for managing customer bare-metal fleets at rack-level-and-up—IT service, break-fix, network and firmware management—including Bare Metal as a Service (BMaaS) and customer self-service patterns. Partner with Data Center Operations, Fleet Operations, and Networking teams to align facility, hardware, and fabric readiness with customer go-live timelines and operational SLAs. Review and advise on customer-facing technical contract terms, including service scope, operational responsibilities, SLAs, isolation requirements, and support boundaries. Lead proof of concept initiatives to showcase the value and viability of CoreWeave's solutions within specific environments. Drive technical leadership and direction during customer meetings, presentations, and workshops, addressing any technical queries or concerns that arise. Act as a virtual member of CoreWeave's Compute Infrastructure, Fleet Operations, and Networking engineering teams, identifying opportunities for product enhancement and collaborating with engineers to implement your suggestions. Offer valuable insights on product features, functionality, and performance, contributing regularly to discussions about product strategy and architecture. Stay informed of the latest developments and trends in Kubernetes, cloud computing and infrastructure, sharing your thought leadership with customers and internal stakeholders. Lead the prototyping and initiation of research and development efforts for emerging products and solutions, delivering prototypes and key insights for internal consumption. Represent CoreWeave at conferences and industry events, with occasional travel as required. Who You Are: B.S. in Computer Science or a related technical discipline, or equivalent experience 7+ years of proven experience as a Solutions Architect, Field Engineer, Infrastructure/Systems Engineer, or Technical Account Manager in Cloud Infrastructure, focusing on building or operating distributed systems or HPC/cloud services, with an expertise focused on bare-metal compute infrastructure and large-scale GPU cluster delivery Fluency in cloud computing concepts, architecture, and technologies with hands-on experience in designing and implementing cloud solutions Proven track record with building customer relationships, communicating clearly and the ability to break down complex technical concepts to both technical and non-technical audiences Deep expertise with modern rack-scale GPU server hardware (e.g., NVIDIA HGX / GB200-class systems), high-speed interconnects (InfiniBand, NVLink), and the firmware/BMC/BIOS layer Expert-level Linux system admini

Want this job?

Let DoneWithWork tailor your resume to this exact posting, write the cover letter, and submit the application for you.

Apply with DoneWithWork — $19.99/mo

View original posting →