Full-time

Backend Engineer Routing Token Flow

Posted by Fidel Consulting KK • Tokyo ( Remote ) , Tokyo ( Remote ) , Japan

📍 Tokyo ( Remote ) , Tokyo ( Remote ) 🕒 February 28, 2026

About the Role


Appealing points:


Work at the edge on ultra-low-latency systems, optimizing inference traffic, token flow, and routing logic for real-time AI workloads.
Solve deep technical challenges across networking, distributed caching, and protocol optimization using Cloudflare Workers, Anycast, and modern web protocols.
High-impact role combining edge security, ML inference awareness, and performance engineering, where your work directly protects systems and improves user experience at scale.


Annual Salary: 8 Million yen and Above

Job Responsibilities:


Architect Intelligent Routing Logic: Design and implement a dynamic "Intelligent Router" that uses real-time metrics and ML-based scoring to select the optimal GPU Pool for every request. You will ensure efficient GPU utilization and prevent SLA violations by routing traffic based on node health and congestion.
Implement Model-Based Parsing: Build logic within the Edg...

Ready to Apply?

Submit your application today and take the next step in your career journey with Fidel Consulting KK .

Apply Now