
See Salary Ranges
Job Description
As a Software Engineer on the AI Models / System Bring-Up team, you will help bring up, validate, and optimize AI models on Tenstorrent platforms. You will work across models, runtime software, and hardware to turn research workloads into reliable, high-performance systems.
This role sits at the intersection of technical expertise and customer engagement, focused on helping customers and internal teams bring up and optimize AI models on Tenstorrent platforms.
This role is hybrid, based out of Tokyo, Japan.
We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
Qualifications
Who You Are
- Experience with deep learning models in at least one major framework such as PyTorch, TensorFlow, or JAX.
- Strong Python or C++ skills and good understanding of neural network architectures, training, and inference workflows.
- Comfortable working in Linux and able to debug issues across software, runtime, and hardware.
What We Need
- Bring up and validate AI models such as LLMs, CNNs, recommendation models, and vision models on Tenstorrent hardware and simulators.
- Port models into Tenstorrent toolchains and runtime environments.
- Run experiments to evaluate model accuracy, performance, and stability.
- Debug cross-stack issues and work closely with hardware, compiler, and runtime teams.
- Collaborative and curious, with a degree in Computer Science, Engineering, Applied Mathematics, or a related field, or equivalent practical experience.
- Fluent in English; Japanese proficiency is preferred.
What You Will Learn
- How AI models are mapped and optimized on custom AI accelerators.
- How hardware, compiler, runtime, and model teams work together to build production-ready systems.
- Best practices for model bring-up, automation, regression testing, and performance tuning.
- How to translate real-world model requirements into practical technical solutions.
Nice to Have
- Experience with LLM or foundation model inference, including KV-cache optimization and quantization.
- Background in compiler or runtime engineering for ML workloads.
- Exposure to post-silicon validation, board bring-up, firmware development, or accelerator platforms.
- Experience working directly with customers or field teams on AI workload deployment and debugging.
About the Company
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.