14 Views

Silicon Architect, Diffusion ASICs

Published Date: May 28, 2026
Normal Computing Corporation, New York, NY
Job Description:

Normal Computing is at the forefront of developing innovative software and hardware solutions for the semiconductor industry and AI infrastructure. With teams across major global cities, we are focused on creating ASICs specifically designed for image and video diffusion inference, challenging the conventional GPU paradigm. We seek a talented individual to contribute to the architecture and microarchitecture of our AI accelerator compute blocks, ensuring our research translates into tangible silicon solutions.

Responsibilities:

  • Define the architecture and microarchitecture of novel AI accelerator compute blocks, focusing on PE array design and efficiency techniques.
  • Translate workload analysis and research findings into clear hardware specifications for implementation by RTL engineers.
  • Evaluate PPA tradeoffs across the architecture stack, making informed decisions based on incomplete data.
  • Collaborate with the compiler lead on ISA co-design to ensure the compute tile is both compilable and programmable.
  • Oversee FPGA prototyping to validate architecture decisions and drive implementation through to bring-up.
  • Stay updated on AI accelerator research and articulate Normal's unique approach in comparison to existing solutions.

Qualifications:

  • Degree in Electrical Engineering, Computer Engineering, Computer Science, or equivalent experience; PhD preferred but not required.
  • Substantial experience in architecture or microarchitecture of high-performance digital systems, particularly AI accelerators or complex logic.
  • Fluency in translating algorithm-level analysis into hardware specifications and vice versa.
  • Experience with simulation-driven architecture and cycle-accurate models for design decisions.
  • Familiarity with quantization and reduced-precision approaches for inference and their hardware implications.
  • Experience writing microarchitecture specifications and collaborating with RTL engineers during implementation.

Skills:

  • Proficiency in Python or C++ for performance modeling and analysis.
  • Familiarity with SystemVerilog or equivalent RTL.
  • Strong analytical skills to navigate between algorithm-level behavior and hardware design.
  • Ability to work in a dynamic environment where architecture is actively being developed.

Recent Stories


Logo Image
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.