Full-time
Backend Engineer Routing Token Flow
Posted by Fidel Consulting KK • Tokyo ( Remote ) , Tokyo ( Remote ) , Japan
About the Role
Appealing points:
Work at the edge on ultra-low-latency systems, optimizing inference traffic, token flow, and routing logic for real-time AI workloads.
Solve deep technical challenges across networking, distributed caching, and protocol optimization using Cloudflare Workers, Anycast, and modern web protocols.
High-impact role combining edge security, ML inference awareness, and performance engineering, where your work directly protects systems and improves user experience at scale.
Annual Salary: 8 Million yen and Above
Job Responsibilities:
Architect Intelligent Routing Logic: Design and implement a dynamic "Intelligent Router" that uses real-time metrics and ML-based scoring to select the optimal GPU Pool for every request. You will ensure efficient GPU utilization and prevent SLA violations by routing traffic based on node health and congestion.
Implement Model-Based Parsing: Build logic within the Edg...
Ready to Apply?
Submit your application today and take the next step in your career journey with Fidel Consulting KK .
Apply Now