About the role
<div class="content-intro"><p>Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.</p></div><p>As a Software Engineer on the Metal Runtime team at Tenstorrent, you’ll work on the low-level software that powers our AI accelerators. You’ll build and optimize high-performance runtime systems that execute directly on the hardware, focusing on scheduling, memory movement, and efficient execution across massively parallel processors. We believe runtime systems are a core part of performance: they determine how hardware resources are utilized, how data flows through the system, and how efficiently workloads are executed at scale.</p> <p>If you enjoy pushing performance, working close to the metal, and solving complex systems challenges at the hardware/software boundary, this is your kind of role.</p> <p>This role is hybrid, based out of Santa Clara, CA; Austin, TX; Toronto, ON.</p> <p>We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.</p> <p>&nbsp;</p> <p><strong>Who You Are</strong></p> <ul> <li>Strong coder in C or C++, and love working close to hardware.</li> <li>Comfortable thinking in terms of threads, processors, and memory movement, and experienced writing fast, parallel code across multiple processors.</li> <li>Enjoy optimizing execution, scheduling, and memory systems for performance at scale.</li> <li>Comfortable debugging at the hardware/software boundary and reasoning about system behavior end-to-end.</li> <li>Curious about how runtimes and bare-metal software come together to power AI.</li> <li>Collaborative by nature and enjoy solving tough problems with smart people.</li> </ul> <p>&nbsp;</p> <p><strong>What We Need</strong></p> <ul> <li>Build and optimize the Metal runtime that executes directly on our AI accelerators.</li> <li>Develop bare-metal systems that manage compute execution, scheduling, and resource utilization.</li> <li>Optimize memory movement and data flow across massively parallel hardware.</li&