Get matched →

Senior Observability Engineer

at FanDuel

FanDuelNew York CityPosted 2026-06-22
Want this job?

Let DoneWithWork tailor your resume to this exact posting, write the cover letter, and submit the application for you.

Apply with DoneWithWork — $19.99/mo

View original posting →

Job description

THE POSITIONOur roster has an opening with your name on it FanDuel is looking for a Senior Observability Engineer to design, build, and mature the observability ecosystem that underpins our platform and services. You will deliver deep visibility into system behavior by combining system telemetry with user signals to provide a holistic view of performance, reliability, and user experience. You’ll also explore how AI and machine learning can enhance observability, from intelligent alerting and anomaly detection to accelerating root cause analysis. This is a hands-on role. You’ll partner closely with engineering and product teams to deliver scalable observability capabilities, serve as a subject matter expert in monitoring, alerting, and incident management, and equip teams with self-service insights and tooling. By connecting system behavior to real user impact and leveraging AI-assisted workflows to surface issues faster, you’ll drive improvements in reliability, performance, and data-informed decision-making across the organization. In addition to the specific responsibilities outlined above, employees may be required to perform other such duties as assigned by the Company. This ensures operational flexibility and allows the Company to meet evolving business needs. THE GAME PLANEveryone on our team has a part to play Contribute to the observability strategy and roadmap, partnering with multiple teams to align with business priorities and engineering goals. Design and enhance scalable observability solutions that provide actionable insights into system health, performance, and user experience. Help establish and promote best practices for monitoring, alerting, incident management, and postmortems across teams. Support operational excellence by improving incident response processes, on-call practices, and post-incident reviews, focusing on continuous improvement. Collaborate on cross-team initiatives to improve system reliability, identifying risks and contributing to their resolution. Apply automation and AI-assisted workflows to improve root cause analysis and reduce operational toil. Work with engineering and product stakeholders to surface observability insights that inform technical decisions and prioritization. Analyze system and user signals to help detect, prevent, and mitigate reliability issues. Contribute to optimizing observability platforms for performance, scalability, and cost-efficiency. Mentor peers and contribute to raising observability and reliability standards within the team. In addition to the responsibilities outlined above, employees may be required to perform other duties as assigned by the Company to ensure operational flexibility and meet evolving business needs. A Sneak Peek Into Our Tech Stack AWS, Kubernetes, Terraform, Helm, Ansible, Vault, Datadog and PagerDuty   THE STATSWhat we're looking for in our next teammate Solid hands-on experience in observability engineering, SRE, platform engineering, or related roles, with impact across team-level systems. Strong expertise in monitoring and observability practices, with hands-on experience using tools such as Datadog. Experience contributing to observability or reliability initiatives across teams or services. Proficiency with Kubernetes, cloud infrastructure (e.g. AWS), and infrastructure-as-code tools such as Terraform. Ability to influence technical decisions within and across teams, collaborating effectively with a range of stakeholders. Good understanding of distributed systems principles (e.g. consistency, availability, partition tolerance) and practical trade-offs. Experience defining and implementing SLOs, SLIs, and alerting strategies, including an understanding of user-impacting metrics. Strong software engineering fundamentals, with proficiency in at least one modern programming language (e.g. Go, Java, Python, or TypeScript), and experience building tooling, automation, and scalable systems. Experience improving systems through automation, helping reduce operational toil and recurring issues. Strong analytical and problem-solving skills, with the ability to interpret technical signals and relate them to system performance and reliability. Good communication and collaboration skills, with the ability to work effectively with both technical and non-technical stakeholders. A sense of ownership and accountability, with a focus on delivering reliable, scalable solutions and continuous improvement. Don’t check all the boxes? That’s okay! We encourage you to still apply if you feel like you possess an adjacent skill set and are interested in learning more about this position. ABOUT FANDUEL FanDuel Group is the premier mobile gaming company in the United States and Canada. FanDuel Group consists of a portfolio of leading brands across mobile wagering including: America’s #1 Sportsbook, FanDuel Sportsbook; its leading iGaming platform, FanDuel Casino; the industry’s unquestioned leader in horse racing and advance-deposit wagering, FanDuel Racing; and its daily fantasy sports product.   In addition, FanDuel Group operates FanDuel TV, its broadly distributed linear cable television network and FanDuel TV+, its leading direct-to-consumer OTT platform. FanDuel Group has a presence across all 50 states, Canada, and Puerto Rico. The company is based in New York with US offices in Los Angeles, Atlanta, and Jersey City, as well as global offices in Canada and Scotland. The company’s affiliates have offices worldwide, including in Ireland, Portugal, Romania, and Australia. FanDuel Group is a subsidiary of Flutter Entertainment, the world's largest sports betting and gaming operator with a portfolio of globally recognized brands and traded on the New York Stock Exchange (NYSE: FLUT). PLAYER BENEFITSWe treat our team right We offer amazing benefits above and beyond the basics. We have an array of health plans to choose from (some as low as $0 per paycheck) that include programs for fertility and family planning, me
Want this job?

Let DoneWithWork tailor your resume to this exact posting, write the cover letter, and submit the application for you.

Apply with DoneWithWork — $19.99/mo

View original posting →