Full-time

ML Software Engineer - Platform LLM Training and Inference

Posted by Rayn Group. • Multan, Punjab, Pakistan

📍 Multan, Punjab 🕒 February 28, 2026

About the Role

What you will bring to Rayn as a ML Software Engineer

  • Design and implement core services for the ILM orchestration platform
  • Build unified pipelines for fine tuning and training methods including LoRA, QLoRA, and RL-based approaches
  • Integrate and manage multiple LLM runtimes such as vLLM, TensorRT, and llama.cpp
  • Implement GPU-aware scheduling, resource allocation, and workload isolation
  • Optimize VRAM usage, KV-cache management, and inference throughput
  • Build internal APIs and developer tooling for model lifecycle management
  • Collaborate closely with hardware, platform, and research teams

Who We’re Looking For

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  • Strong experience in JavaScript, Python and C++ with a minimum of 6+ years of experience in software development, architecture, and team leadership roles.
  • Solid unders...

Ready to Apply?

Submit your application today and take the next step in your career journey with Rayn Group..

Apply Now