Responsibilities
- Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx
- Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments
- Integrate AI features into existing products, enriching them with the latest advancements in machine learning
Requirements
- Excellent programming skills in C++
- Strong experience with Llama.cpp and ggml inference engines, which facilitates the deployment of models to specific GPU architectures
- Good understanding of deep learning concepts and model architectures
- Experience with transformers, LLMs, Diffusion models
- Demonstrated ability to rapidly assimilate new technologies and techniques
- A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D
Nice to Have
- Experience in Javascript