Full-time

Senior AI Research Engineer, Model Inference (Remote)

Posted by Tether.io • , , Spain, , , Spain, Spain

📍 , , Spain, , , Spain 🕒 February 23, 2026

Apply for this Job Similar Jobs

About the Role

Senior AI Research Engineer, Model Inference (Remote) Join to apply for the Senior AI Research Engineer, Model Inference (Remote) role at Tether.io 
Get AI-powered advice on this job and more exclusive features. 
About the job We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine-tuning for Language models with a strong focus on mobile and integrated GPU acceleration (Vulkan). 
This role requires hands-on experience with quantization techniques, LoRA architectures, Vulkan backend, and mobile GPU debugging. You will play a critical role in pushing the boundaries of desktop and on-device inference and fine-tuning performance for next-generation SLM/LLMs. 
Responsibilities Implement and optimize custom inference and fine-tuning kernels for small an...
                

Job Details

Location , , Spain, , , Spain
Job Type Full-time
Category Ingeniería y tecnología
Posted February 23, 2026
Deadline April 04, 2026

Ready to Apply?

Submit your application today and take the next step in your career journey with Tether.io.

Apply Now