[HW] GPU Subsystem Engineer
부문
R&D
직군
Engineering
직무
HW Engineer
경력사항
경력 5년 이상
고용형태
정규직
근무지
㈜딥엑스대한민국 경기도 성남시 판교역로241번길 20 미래에셋벤처타워 5층, ㈜딥엑스

Frontier of ​On-device ​AI ​Semiconductors

for Everyone, ​Everywhere


About DeepX Co., Ltd.


DEEPX ​is ​a forward-thinking ​Series D startup ​architecting the ​infrastructure ​for the ​Physical ​AI ​era.

By delivering ​the ​world’s most energy-efficient ​NPU ​technology, ​we are solving ​the critical ​power ​and heat ​challenges of ​Generative ​AI to bring ​super-intelligence to ​every device, everywhere.

 

Our global leadership is validated by our record-breaking recognition as a multi-year CES Innovation Award honoree (2024 & 2026) and being named the 2024 Frost & Sullivan Company of the Year in the NPU sector. With an enterprise value approaching 1 trillion KRW, DEEPX offers a unique pre-IPO

opportunity to join a market leader defining the new industry standard for the $70B AI semiconductor market.

 

We are currently scaling toward mass production of our flagship DX-M1 (Samsung 5nm) with over 50 global projects scheduled, while engineering next-generation 2nm solutions to support 100B parameter Large Language Models (LLM) at the edge. ☞ Link

 

★ If you want to be part of world-class innovation? Please talk with us.

★Explore our journey: The DEEPX White Paper ☞ Link


Role Overview

DEEPX is seeking a highly skilled GPU Integration Engineer to join our SoC Development team. In this role, you will be responsible for the seamless integration of GPU IPs into our AI Inference SoCs. You will ensure that the GPU subsystem interacts efficiently with our proprietary NPU, memory controllers, and high-speed interconnects to deliver optimized performance for edge computing and autonomous multimedia applications.


Responsibilities

IP Integration & RTL Design: Lead the integration of high-performance GPU IPs (e.g., ARM Mali, Imagination, or proprietary cores) into the SoC top-level environment.

Interconnect Architecture: Design and optimize data paths between the GPU and the System-on-Chip using AMBA protocols (AXI, ACE, CHI) and Network-on-Chip (NoC) solutions.

Performance Optimization: Analyze GPU throughput and latency to minimize bottlenecks in data-heavy AI and graphics workloads.

System Validation: Conduct RTL-level verification, Lint, CDC (Clock Domain Crossing), and RDC (Reset Domain Crossing) checks to ensure robust hardware stability.

Power & Area Management: Collaborate with the physical design team to optimize the GPU's power consumption (UPF/CPF) and silicon footprint for edge devices.

Hardware-Software Co-design: Work with the firmware and driver teams to define GPU memory maps, interrupt handling, and power-up sequences.


Qualifications

Education: B.S. or M.S. in Electrical Engineering, Computer Engineering, or a related field.

Experience: 5+ years of experience in SoC integration or RTL design specifically involving GPU or complex processor subsystems.

Technical Skills:

  • Expertise in Verilog or SystemVerilog.
  • Deep understanding of AMBA bus architectures (specifically AXI4 and ACE for cache coherency).
  • Familiarity with GPU pipeline architectures and memory management units (IOMMU/SMMU).

Tools: Experience with EDA tools such as Synopsys (VCS, SpyGlass) or Cadence (Xcelium, JasperGold).


Preferred Qualifications

Experience of FPGA Implementation

Graphics Knowledge: Familiarity with graphics APIs (OpenGL ES, Vulkan, or OpenCL) and how hardware accelerates these standards.

Cache Coherency: Experience with hardware-managed cache coherency between CPUs, GPUs, and NPUs.

Advanced Nodes: Proven track record of tape-outs in advanced process nodes (7nm, 5nm, or below).

Post-Silicon Support: Experience in silicon bring-up and debugging GPU-related issues in the lab.

Communication: Strong technical English communication skills for collaborating with global IP vendors.


Recruitment Process

Application Review - (Phone Interview) - Technical Interview - Organizational Culture Fit Interview - CEO Interview - Reference Check / Compensation Discussion

※ The recruitment process may vary depending on the position and application content.

※ Candidates with less than 3 years of experience are required to submit their academic transcripts.


Employment Type

Full-time (3-month probationary period with 100% compensation)


Working Hours

Monday to Friday, 9:00 AM – 6:00 PM (Lunch break: 12:00 PM – 1:00 PM)


Notes

If any false information is found in the application or onboarding documents, the job offer may be withdrawn even after confirmation.

A 3-month probationary period applies after joining, with no reduction in salary or benefits.


Benefits

□ 모든 정규직 입사자에게 연봉 수준의 스톡옵션 부여

□ 최신 사양 장비 및 최고의 근무 환경 제공 (최신 노트북, 높낮이 조절식 스탠딩 데스크, 모니터암, 듀얼모니터 등 제공)

□ 점심식사 + 아침 & 저녁식사 지원

□ 스낵, 아이스크림, 음료 등 사내 카페 무제한 간식 제공

□ 사우나가 포함된 피트니스 비용 지원

□ 연 1회 종합건강검진 지원 (배우자 포함)

□ 생일, 결혼기념일, 크리스마스이브 축하금 지급 및 조기퇴근 제공

□ 설/추석 명절 상여금 지급

□ 축하와 위로를 위한 경조휴가 및 경조금 지원

공유하기
[HW] GPU Subsystem Engineer

Frontier of ​On-device ​AI ​Semiconductors

for Everyone, ​Everywhere


About DeepX Co., Ltd.


DEEPX ​is ​a forward-thinking ​Series D startup ​architecting the ​infrastructure ​for the ​Physical ​AI ​era.

By delivering ​the ​world’s most energy-efficient ​NPU ​technology, ​we are solving ​the critical ​power ​and heat ​challenges of ​Generative ​AI to bring ​super-intelligence to ​every device, everywhere.

 

Our global leadership is validated by our record-breaking recognition as a multi-year CES Innovation Award honoree (2024 & 2026) and being named the 2024 Frost & Sullivan Company of the Year in the NPU sector. With an enterprise value approaching 1 trillion KRW, DEEPX offers a unique pre-IPO

opportunity to join a market leader defining the new industry standard for the $70B AI semiconductor market.

 

We are currently scaling toward mass production of our flagship DX-M1 (Samsung 5nm) with over 50 global projects scheduled, while engineering next-generation 2nm solutions to support 100B parameter Large Language Models (LLM) at the edge. ☞ Link

 

★ If you want to be part of world-class innovation? Please talk with us.

★Explore our journey: The DEEPX White Paper ☞ Link


Role Overview

DEEPX is seeking a highly skilled GPU Integration Engineer to join our SoC Development team. In this role, you will be responsible for the seamless integration of GPU IPs into our AI Inference SoCs. You will ensure that the GPU subsystem interacts efficiently with our proprietary NPU, memory controllers, and high-speed interconnects to deliver optimized performance for edge computing and autonomous multimedia applications.


Responsibilities

IP Integration & RTL Design: Lead the integration of high-performance GPU IPs (e.g., ARM Mali, Imagination, or proprietary cores) into the SoC top-level environment.

Interconnect Architecture: Design and optimize data paths between the GPU and the System-on-Chip using AMBA protocols (AXI, ACE, CHI) and Network-on-Chip (NoC) solutions.

Performance Optimization: Analyze GPU throughput and latency to minimize bottlenecks in data-heavy AI and graphics workloads.

System Validation: Conduct RTL-level verification, Lint, CDC (Clock Domain Crossing), and RDC (Reset Domain Crossing) checks to ensure robust hardware stability.

Power & Area Management: Collaborate with the physical design team to optimize the GPU's power consumption (UPF/CPF) and silicon footprint for edge devices.

Hardware-Software Co-design: Work with the firmware and driver teams to define GPU memory maps, interrupt handling, and power-up sequences.


Qualifications

Education: B.S. or M.S. in Electrical Engineering, Computer Engineering, or a related field.

Experience: 5+ years of experience in SoC integration or RTL design specifically involving GPU or complex processor subsystems.

Technical Skills:

  • Expertise in Verilog or SystemVerilog.
  • Deep understanding of AMBA bus architectures (specifically AXI4 and ACE for cache coherency).
  • Familiarity with GPU pipeline architectures and memory management units (IOMMU/SMMU).

Tools: Experience with EDA tools such as Synopsys (VCS, SpyGlass) or Cadence (Xcelium, JasperGold).


Preferred Qualifications

Experience of FPGA Implementation

Graphics Knowledge: Familiarity with graphics APIs (OpenGL ES, Vulkan, or OpenCL) and how hardware accelerates these standards.

Cache Coherency: Experience with hardware-managed cache coherency between CPUs, GPUs, and NPUs.

Advanced Nodes: Proven track record of tape-outs in advanced process nodes (7nm, 5nm, or below).

Post-Silicon Support: Experience in silicon bring-up and debugging GPU-related issues in the lab.

Communication: Strong technical English communication skills for collaborating with global IP vendors.


Recruitment Process

Application Review - (Phone Interview) - Technical Interview - Organizational Culture Fit Interview - CEO Interview - Reference Check / Compensation Discussion

※ The recruitment process may vary depending on the position and application content.

※ Candidates with less than 3 years of experience are required to submit their academic transcripts.


Employment Type

Full-time (3-month probationary period with 100% compensation)


Working Hours

Monday to Friday, 9:00 AM – 6:00 PM (Lunch break: 12:00 PM – 1:00 PM)


Notes

If any false information is found in the application or onboarding documents, the job offer may be withdrawn even after confirmation.

A 3-month probationary period applies after joining, with no reduction in salary or benefits.


Benefits

□ 모든 정규직 입사자에게 연봉 수준의 스톡옵션 부여

□ 최신 사양 장비 및 최고의 근무 환경 제공 (최신 노트북, 높낮이 조절식 스탠딩 데스크, 모니터암, 듀얼모니터 등 제공)

□ 점심식사 + 아침 & 저녁식사 지원

□ 스낵, 아이스크림, 음료 등 사내 카페 무제한 간식 제공

□ 사우나가 포함된 피트니스 비용 지원

□ 연 1회 종합건강검진 지원 (배우자 포함)

□ 생일, 결혼기념일, 크리스마스이브 축하금 지급 및 조기퇴근 제공

□ 설/추석 명절 상여금 지급

□ 축하와 위로를 위한 경조휴가 및 경조금 지원