Software Engineer, Acceleration Kernel Development
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. As a Software Engineer on the Acceleration Kernel Development team, you’ll write low-level code that optimizes high-efficiency machine learning workloads, focusing on performance and precision.
Responsibilities
- Writing low-level code that directly powers high-efficiency machine learning workloads, optimizing every cycle, every memory move, every instruction
- Building and optimizing compute kernels for parallel ML and high-performance workloads
- Analyzing and tuning instruction-level performance across latency, memory, and bandwidth
- Collaborating closely with ML engineers to integrate optimizations into production
- Debugging, profiling, and maintaining a fast, reliable low-level software stack
Skills
- Expertise in building and optimizing compute kernels for parallel ML and high-performance workloads
- Ability to analyze and tune instruction-level performance across latency, memory, and bandwidth
- A collaborative mindset to work closely with ML engineers and integrate optimizations into production
- Ownership of debugging, profiling, and maintaining a fast, reliable low-level software stack
- Comfortable in C/C++ and able to build fast, efficient code from the ground up
- Obsessed with performance and precision, especially in tensors and ML workloads
- Motivated by complex problems and thrives in collaborative, fast-moving environments
Benefits
- Highly competitive compensation package
- Benefits
Company Overview
Apply To This Job