Toronto, Canada Hybrid Employment

Tenstorrent is hiring an Acceleration Kernel Developer Intern

Responsibilities

  • Participate in the design, development, and maintenance of kernel-level software components for our applications.
  • Develop and optimize kernels and kernel libraries for efficient machine learning and HPC applications.
  • Implementation of tensor compute and tensor data movement optimizations kernels
  • Heavy focus on optimizations.
  • Analyze and optimize low-level code to improve the performance and efficiency of our software, with a strong emphasis on tensor optimization.
  • Collaborate with machine learning engineers and data scientists to integrate optimized kernels and low-level routines into machine learning frameworks and pipelines.
  • Identify performance bottlenecks, conduct performance profiling, and develop strategies to address and resolve them.
  • Write comprehensive unit tests, conduct thorough debugging, and ensure the stability and reliability of kernel-level code.
  • Develop infrastructure to analyze functional test and performance test data.
  • Create clear and concise documentation for code, APIs, and best practices to facilitate collaboration within the team.
  • Stay up-to-date with the latest developments in kernel development, tensor optimization, and machine learning to propose innovative solutions and improvements.

Requirements

  • Currently enrolled in a bachelors degree, or a masters or doctorate program in Computer Science, Software Engineering, Computer Engineering, Engineering Science or a related field.
  • A strong understanding of algorithm implementation and low level algorithm optimization.
  • Experience with parallel acceleration development, such as parallel algorithms on CPU, or GPU acceleration.
  • High degree of proficiency in C/C++ and Python programming languages.
  • Familiarity with machine learning frameworks and concepts.
  • Strong problem-solving skills and the ability to analyze and debug complex issues.
  • Experience with debug, performance profiling and optimization tools.
  • Excellent communication and teamwork skills.
  • Self-motivated, detail-oriented, and able to work independently as well as in a team.

Nice to Have

  • Familiarity with infrastructure built with cloud services is a plus (e.g. AWS).

Work Arrangement

Hybrid — Toronto, Ontario

Additional Information

  • This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology.
  • Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2).
  • These requirements apply to persons located in the U.S. and all countries outside the U.S.
  • As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency.
  • If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.
Required Skills
C/C++
About company
Tenstorrent
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. The company has developed a high performance RISC-V CPU from scratch and is focused on building the best AI platform by unifying innovations in software models, compilers, platforms, networking, and semiconductors.
All jobs at Tenstorrent Visit website
Job Details
Category other
Posted 5 hours ago