★Backend Engineer (Routing & Token Flow, Typescript/Javascript/Rust) | Data Center Services
- English ONLY OK
◆ Start-up Company
◆ Hybrid Work
◆ Own Products/Services
◆ Global Environment
◆ Annual salary: 8 million yen -14 million yen
【About the company】
・Data Center Development Support
・Data Center Maintenance and Operations
・Development and Operation of Cloud Services for Data Centers
【 Job Description】
Role overview
As a Backend Engineer, you will be responsible for the control-plane and gateway layer that connects the customer to our compute serving infrastructure. You will build and power the customer request lifecycle end to end, including handling requests via Cloudflare worker nodes, managing authentication and tenancy, validating requests, routing to the correct model endpoints, enforcing quotas and rate limits, and implementing reliability mechanisms to ensure platform stability under load. This layer is also key to achieving significant service performance latency through optimization. The goal is to build a backend that is fast, secure by default, abuse-resistant, and highly operable at scale. You will collaborate with the inference and platform teams on developing the backend architecture. We expect you to be a central, hands-on contributor to the code stack, driving both the building and technical decision-making as an expert from the ground up. You will work closely with the President and engineering leadership on backend routing decisions that accurately reflect real capacity and failure domains, and ensure the system provides the necessary telemetry for rapid debugging.
Responsibilities:
● Architect Intelligent Routing Logic: Design and implement a dynamic "Intelligent Router" that uses real-time metrics and ML-based scoring to select the optimal GPU Pool for every request. You will ensure efficient GPU utilization and prevent SLA violations by routing traffic based on node health and congestion.
● Implement Model-Based Parsing: Build logic within the Edge Gateway to parse request bodies, identify model parameters, and execute "Model-Based Routing" to direct specific workloads to the appropriate specialized GPU clusters.
● Build Distributed Caching Systems: Engineer a "Distributed KV Cache" strategy that allows GPU pools to share KV caches, significantly reducing duplicate calculations and improving inference speed. You will also manage local edge caches for rate limiting and quota enforcement.
● Optimize Edge Gateway & Security: Leverage Cloudflare Anycast to accept user requests at the edge location nearest to them, solving distance-based latency issues. You will implement security layers to filter malicious requests at the gateway, ensuring they never reach the origin servers.
● High-Throughput Stream Management: Optimize the Token Flow using Cloudflare Network Interconnect, ensuring that inference streams are delivered with minimal jitter and latency (TTFT).
● Protocol & Traffic Engineering: Fine-tune the communication protocols between the Edge and the DC (GPU Pool), handling connection pooling and keep-alives to sustain high throughput.
【 Requirements】
Requirements
● Advanced Edge Engineering: Expert-level TypeScript/JavaScript or Rust/WASM experience specifically within Cloudflare Workers environments. You understand V8 isolates and edge runtime limitations.
● Complex Routing Algorithms: Proven experience building custom load balancers or routing logic You can translate "ML-based scoring" into performant edge code.
● Network Protocol Technical Expertise: Strong grasp of HTTP/2, HTTP/3, WebSockets, and Anycast networking principles to minimize latency.
● Distributed Systems Caching: Experience designing distributed caching mechanisms (Redis, KV stores) where consistency and hit rates are critical for performance.
● ML/Inference Knowledge: Understanding of how LLM KV caches work and how model parameters impact compute requirements.
● Security Engineering: Experience implementing WAF rules or DDoS mitigation logic at the edge.
【Working Time 】
09:00 ~ 18:00
【 Welfare 】
・Transportation Allowance: Partially provided (up to 15,000 yen per month).
・Social Insurance: Health insurance, Employees' Pension, employment insurance, and industrial accident compensation insurance.
・Overtime Allowance: Standard overtime pay provided.
【 Holiday 】
・Annual Holidays: 120 days.
・Work System: Full five-day workweek system.
・Annual Paid Leave: A minimum of 10 days or more provided starting from the 7th month of employment.
-
Kanagawa G TalentWe are looking for a Backend Engineer to join our team. As a Backend Engineer, you will be responsible for the control-plane and gateway layer that connects the customer to our compute serving infrastructure. You will build and power the customer request lifecycle end to end. · A ...
-
Kanagawa G TalentAs a Backend Engineer you will build and power the customer request lifecycle end to end including handling requests via Cloudflare worker nodes managing authentication and tenancy validating requests routing to the correct model endpoints enforcing quotas and rate limits impleme ...