AI Frameworks Engineer – GPU Performance for Generative AI (OpenVINO)
at Intel
Want this job?
Let DoneWithWork tailor your resume to this exact posting, write the cover letter, and submit the application for you.
Apply with DoneWithWork — $19.99/moJob description
Job Details:Job Description: Role OverviewWe are seeking a software engineer to drive the implementation and performance optimization of generative AI workloads on Intel GPUs as part of the OpenVINO GPU team.This role focuses on building high-performance, HW-aware software that enables efficient execution of AI models on current and future Intel GPU architectures. You will work across multiple layers of the stack—AI models, runtime systems, and GPU hardware—and take ownership of complex performance problems that require deep technical insight and careful trade-off analysis.You will work on state-of-the-art AI models that push the limits of GPU performance. Your work directly impacts real-world AI performance experienced by developers and customers.About OpenVINOOpenVINO(https://github.com/openvinotoolkit/openvino) is a performance-focused AI inference runtime designed to efficiently execute deep learning models across Intel architectures.The GPU plugin is a core component of OpenVINO that bridges high-level AI models and low-level GPU execution, covering areas such as graph transformation, kernel dispatch, memory management, and hardware-specific optimizations.The codebase is performance-critical, largely written in modern C++, and requires strong understanding of system-level software design, debugging, and optimization.What You Will DoTake technical ownership of performance-critical paths for generative AI workloads (e.g., LLMs, diffusion models) on Intel GPUsAnalyze end-to-end execution of AI models to identify compute, memory, bandwidth, and parallelism bottlenecksImplement and optimize generative AI techniques, adapting state-of-the-art ideas to efficiently run on Intel GPU architecturesTranslate deep understanding of GPU hardware architecture into efficient, scalable, and maintainable software designsOptimize workloads for both current and future Intel GPU platforms, including hardware that is still under developmentDiagnose and resolve complex issues that span runtime, kernel, driver, and hardware boundariesCollaborate with global teams across software, hardware architecture, and validation to deliver optimized solutionsQualifications:Required QualificationsComputer science, computer engineering, or a related field with 3+ years of professional software engineering experienceStrong programming skills in C and C++; working experience with PythonExperience working with large and complex C++ codebases, with attention to performance, correctness, and maintainabilityProven analytical thinking and strong problem-solving abilities, especially for ambiguous or under-specified problemsPreferred QualificationsExperience with GPU programming or parallel computing, such as multi-threading, SIMD, or accelerator programming modelsStrong understanding of computer and GPU architecture, and how hardware characteristics impact software performanceTechnical understanding of generative AI models from a system and performance perspectiveFamiliarity with AI runtimes or frameworksSolid foundation in computer science fundamentals, including data structures, algorithms, and operating systemsAbility to communicate technical ideas clearly in written and spoken EnglishWork ModelThis role follows a structured hybrid work model. The team regularly combines remote work and in-office collaboration, with a designated in-office days each week, while the remaining days are remote. Job Type:Experienced HireShift:Shift 1 (Korea, Republic of)Primary Location: South Korea, SeoulAdditional Locations:Business group:The Software Team drives customer value by enabling differentiated experiences through leadership AI technologies and foundational software stacks, products, and services. The group is responsible for developing the holistic strategy for client and data center software in collaboration with OSVs, ISVs, developers, partners and OEMs. The group delivers specialized NPU IP to enable the AI PC and GPU IP to support all of Intel's market segments. The group also has HW and SW engineering experts responsible for delivering IP, SOCs, runtimes, and platforms to support the CPU and GPU/accelerator roadmap, inclusive of integrated and discrete graphics.Posting Statement:All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.Position of TrustN/AWork Model for this RoleThis role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change.*ADDITIONAL INFORMATION: Intel is committed to Responsible Business Alliance (RBA) compliance and ethical hiring practices. We do not charge any fees during our hiring process. Candidates should never be required to pay recruitment fees, medical examination fees, or any other charges as a condition of employment. If you are asked to pay any fees during our hiring process, please report this immediately to your recruiter.
Want this job?
Let DoneWithWork tailor your resume to this exact posting, write the cover letter, and submit the application for you.
Apply with DoneWithWork — $19.99/mo