Full-time

AI Inference & Compression Engineer

Posted by PERSOL APAC • Singapore, Singapore, Singapore

📍 Singapore, Singapore 🕒 May 29, 2026

Apply for this Job Similar Jobs

About the Role

About the company:

We have partnered with a renowned global leader in information and communications technology (ICT) infrastructure and smart devices. They are providing full-stack, all-scenario solution for products and services carriers, enterprises, governments, and individual consumers worldwide.

Our client is looking for an AI Inference & Compression Engineer to join the team.

Job Overview:
This role focuses on developing high-performance compression and inference techniques across both classical video/media codecs and modern Large Language Model (LLM) inference systems. You will design intelligent pipelines that deliver higher visual quality at lower bitrates, while simultaneously developing algorithms to reduce memory footprint and computational bottlenecks in generative AI serving.

Key Responsibilities
LLM Inference Acceler...

Job Details

Location Singapore, Singapore
Job Type Full-time
Category other-general
Posted May 29, 2026
Deadline July 08, 2026

Ready to Apply?

Submit your application today and take the next step in your career journey with PERSOL APAC.

Apply Now