Full-time

RL Engineer: LLMs & Code Gen - Hybrid

Posted by Code Metal • boston, davao oriental, Philippines

📍 boston, davao oriental 🕒 June 08, 2026

About the Role

Code Metal in Boston, Davao Oriental, Philippines is seeking a skilled professional to bridge production and research roles in AI. You will be responsible for building distributed training systems using PyTorch and developing scalable data curation pipelines.

The ideal candidate has strong expertise in reinforcement learning and will engage with frontier research to apply RLHF to Large Language Models, particularly in code generation tasks. Benefits include comprehensive health care, a 401k with matching, and a flexible hybrid work arrangement.

#J-18808-Ljbffr

Ready to Apply?

Submit your application today and take the next step in your career journey with Code Metal.

Apply Now