Full-time

Associate Director, Software Engineering (Model Hosting/Inference Optimisation)

Posted by HSBC Global Services Limited • Shenzhen, Guangdong, China

📍 Shenzhen, Guangdong 🕒 June 18, 2026

About the Role

Some careers have more impact than others.

If you’re looking for a career where you can make a real impression, join HSBC and discover how valued you’ll be.

 

We are currently seeking an experienced professional to join our team in the role of Associate Director, Software Engineering (Model Hosting/Inference Optimisation).

 

Business: CTO Platforms (AI Platforms)

Location: Shenzhen / Guangzhou

Req ID: 44990

 

Principal responsibilities

  • Design, build, and operate scalable, reliable model hosting platforms for LLMs, embeddings, and STT/TTS across heterogeneous hardware. 
  • Drive inference optimisation for latency, throughput, and cost (quantisation, KV-cache optimisation, dynamic/continuous batching). 
  • Evaluate, integrate, and tailor inference frameworks (e.g., vLLM, TensorRT-LLM,...

Ready to Apply?

Submit your application today and take the next step in your career journey with HSBC Global Services Limited.

Apply Now