Tokyo Rakuten
Job Description

Business Overview


The Technology Platforms Division (TPD) drives the growth of Rakuten's ecosystem by delivering innovative, high-quality technology platforms characterized by integrated control and strategic partnerships and responsible for building and operating the infrastructure and ecosystem platforms which power the Rakuten Group.

Department Overview


Our department, BSS Ops Department (BSOPD) provides operational service for BSS applications both B2B & B2C and also responsible for maintenance of IT infra (on-premise and cloud environment) for BSS platform.

Position

Why We Hire


We are looking for Entrepreneurial, Innovative, Growth-Oriented, and Customer-obsessed individuals to join our growing team to build the Telco of the Future.


We are a truly global organization, with team members from Japan, India, North America, South America, Europe, China, Korea, Australia, Africa, and more, shifting to a fast-paced, agile way of working.

Position Details

Ensure high availability, resilience, and scalability across multi-region production environments through automation and proactive monitoring.
Design and maintain CI/CD pipelines (Jenkins, GitLab CI, ArgoCD) to enable continuous delivery for microservice and portal components.
Build and operate observability frameworks (metrics, logs, and traces) using Dynatrace, Grafana, Prometheus, Splunk, and Kibana.
Develop and enhance infrastructure-as-code templates (Terraform, Ansible) to manage cloud and on-premise resources consistently.
Participate in the on-call rotation for critical incidents, lead service restoration, and perform detailed Root Cause Analyses (RCA).
Collaborate with development, product, and network teams to optimize system performance and stability across Rakuten's digital ecosystem.
Implement and track SLOs, SLIs, and SLAs for all critical services to improve reliability and align with business objectives.
Contribute to post-incident reviews, drive automation for recurring issues, and continuously enhance system resilience.
Create and maintain runbooks, dashboards, and knowledge base documentation for operational readiness and training.
Support regular maintenance, feature rollouts, and security patching for production and pre-production environments.

Mandatory Qualifications

Technical Expertise

Cloud Platforms:

Extensive hands-on experience with AWS and/or Rakuten Cloud Platform (RCP) services (e.g., EC2, EKS, S3, IAM, VPC, Route 53).


Containerization & Orchestration:
Strong expertise with Docker, Kubernetes (K8s), and Helm for deploying, scaling, and managing distributed, microservice-based applications. Experience with Helm charts, ConfigMaps, and Secrets management.
Infrastructure as Code (IaC): Proficiency with Terraform, CloudFormation, or Ansible for automated infrastructure provisioning, configuration management, and drift detection.

CI/CD Automation:

Deep knowledge and hands-on experience designing and implementing automated build and deployment pipelines using Jenkins, GitLab CI/CD, and ArgoCD.

Familiarity with Git branching strategies, artifact management (Nexus, Artifactory), and code quality gates (SonarQube). Experience with blue-green and canary deployment strategies.

Monitoring & Observability:

Expert-level experience with Dynatrace, Grafana, Prometheus, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, and/or New Relic for full-stack visibility, metrics collection, alerting, and dashboard creation.


Logging & Tracing:
Skilled in centralized logging and distributed tracing tools such such as Dynatrace, New Relic, AppDynamics, Jaeger, or OpenTelemetry. Strong understanding of end-to-end observability for diagnosing complex issues.

Scripting & Automation:

Strong proficiency in Python, Shell (Bash), or Go for developing automation scripts, health checks, self-healing mechanisms, and reliability tools.


Operating Systems:
Expert in Linux/Unix administration, including performance tuning, troubleshooting, and security hardening.

Networking & Security:
Solid understanding of TCP/IP, DNS, load balancing, TLS/SSL, firewalls, and identity management (e.g., OAuth2, SSO).

Incident Management:

Proven experience in handling P1/P2 incidents, leading service restoration, performing detailed Root Cause Analyses (RCA), and implementing preventive measures.


Version Control & Collaboration:
Proficient in Git, Bitbucket, and agile collaboration tools like JIRA and Confluence.
Domain & Methodological Knowledge

Telecom BSS/OSS Systems:
Strong understanding of Rakuten's customer-facing portals, CRM, order workflows, and the broader telecommunications BSS/OSS landscape.

Site Reliability Engineering (SRE): Ability to define and monitor SLOs, SLIs, and SLAs to ensure service reliability and uptime targets.

Familiarity with SRE best practices (e.g., Google SRE model) and error budget management.

Hybrid/Multi-Cloud:
Experience managing Kubernetes clusters and deploying applications in hybrid cloud or multi-cloud environments (AWS EKS, Rakuten Cloud Platform).

Cost Optimization & Capacity Planning:
Experience with cost optimization strategies and capacity planning in cloud environments.

IT Governance:
Familiarity with ITIL and ISO 27001 standards.
Professional Competencies

Problem-Solving:
Exceptional analytical and troubleshooting capabilities to resolve complex, time-sensitive issues efficiently.

Communication:

Excellent verbal and written communication skills to articulate technical issues to both technical teams and non-technical stakeholders (e.g., business users, L1 support).


Adaptability:

The ability to quickly learn and adapt to new front-end technologies, frameworks, and evolving business processes within a dynamic environment.


Customer Focus:
A strong commitment to ensuring a positive and efficient user experience for both customers and internal agents.
Experience & Education
Bachelor's degree in Computer Science, Information Technology, or a related technical field.

Typically 8 to 12 years of experience in an L3 or equivalent technical support role, ideally within the telecommunications sector.

Proven experience with ITSM methodologies and ticketing tools such as ServiceNow or Jira.

Desired Qualifications

Proactive approach to problem-solving.
Strong organizational skills & Experience with budget management.
Knowledge of industry standards and compliance requirements.
Ability to work independently and as part of a team.
Commitment to continuous learning and professional development.

Other Information

Additional information on Location

Rakuten Crimson House (Head office)

#engineer #developmentsupport #technologyplatformdiv

Languages

English (総合 - - 流暢)
Show more Show less

  • Tokyo Rakuten

    The Technology Platforms Division (TPD) drives the growth of Rakuten's ecosystem by delivering innovative technology platforms. · We are looking for entrepreneurial individuals to join our growing team to build the Telco of the Future. · ...


  • Tokyo Rakuten

    + Job summary: Business Overview · The Technology Platforms Division (TPD) drives the growth of Rakuten's ecosystem by delivering innovative, high-quality technology platforms characterized by integrated control and strategic partnerships and responsible for building and operatin ...


  • Tokyo Rakuten

    We are looking for Entrepreneurial Innovative Growth-Oriented and Customer-obsessed individuals to join our growing team to build the Telco of the Future. · We are a truly global organization with team members from Japan India North America South America Europe China Korea Austra ...


  • Greater Tokyo Area Guha Edge Professional Consulting Services

    Join our Technology Platforms Division to help operate and scale critical BSS platforms supporting both B2B and B2C services. · ...


  • Tokyo Rakuten

    + · We are looking for Entrepreneurial, Innovative, Growth-Oriented, · and Customer-obsessed individuals to join our growing team · to build the Telco of the Future., Lead and personally drive resolution of the most complex · & critical production incidents (multi-system failures ...


  • Tokyo Rakuten

    We are looking for Entrepreneurial, Innovative, Growth-Oriented and Customer-obsessed individuals to join our growing team to build the Telco of the Future. · Lead and personally drive resolution of the most complex and critical production incidents (multi-system failures signifi ...


  • Tokyo Rakuten

    We are looking for Entrepreneurial, Innovative, Growth-Oriented, and Customer-obsessed individuals to join our growing team to build the Telco of the Future. · Providing proactive system monitoring, · initial alert handling, · and first-level operational assistance. · ...


  • Tokyo SMALL WORLD / Work in Japan?

    DevOps & Observability Platform Engineer (L2 Support) - Telecom BSS. · Ensure operational excellence for internal DevOps and Observability platforms through proactive monitoring, alert handling, and initial troubleshooting. · ...


  • Tokyo SMALL WORLD / Work in Japan?

    DevOps & Observability Platform Engineer (L2 Support) - Telecom BSS. · ...

  • DevOps Engineer

    7日前


    Tokyo Michael Page

    Operate and optimize Kubernetes-based DevOps and observability platforms. · Troubleshoot incidents, · perform root cause analysis, · and automate operations. · ...


  • Tokyo Oracle ¥8,000,000 - ¥15,000,000 per year

    We are seeking an experienced QA Lead to lead and oversee the quality assurance processes for our Telecom BSS/OSS projects with a focus on Oracle BSS applications. · Lead mentor and manage a team of QA engineers to ensure successful execution of test plans and activities. · Deleg ...


  • Tokyo Rakuten

    Technical Engineer provides expert day-to-day technical support for critical BSS billing applications (Rating, Mediation, Payment, Collection). Diagnose and resolve complex issues impacting billing accuracy. · ...


  • Tokyo Rakuten

    +We deliver agile, scalable solutions across the customer lifecycle and continuously enhance system performance through close collaboration with stakeholders. · +System development and operations of business systems for our voice (phone) services, such as application receiving, m ...


  • Tokyo Rakuten

    Deliver agile, scalable solutions across the customer lifecycle and continuously enhance system performance through close collaboration with stakeholders. · System development and operations of business systems for our voice (phone) services, such as application receiving, mobile ...


  • Tokyo Rakuten

    We are managing and evolving the Business Support Systems (BSS) platform. · The main functions that BSS provides are:end-customer touchpoints, · billing and integration with the core systems of Rakuten Mobile. · ...


  • Tokyo Rakuten

    The Technology Platforms Division (TPD) drives the growth of the Rakuten Ecosystem by delivering innovative, high-quality technology platforms characterized by integrated control and strategic partnerships. · We deliver agile, scalable solutions across the customer lifecycle and ...


  • Tokyo SMALL WORLD / Work in Japan?

    Lead the resolution of critical incidents in the Order-to-Activate domain for Rakuten Mobile's BSS platform. · ...


  • Tokyo Rakuten

    We deliver agile scalable solutions across the customer lifecycle and continuously enhance system performance through close collaboration with stakeholders. We are managing and evolving the Business Support Systems (BSS) platform a critical backbone of Rakuten Mobile services. ...

  • Product Manager

    2週間前


    Tokyo Rakuten

    The Business Support System Product Management Department (BSPMD) is responsible for providing a unified product strategy for the Business Support System (BSS) that integrates with the Rakuten ecosystem to maximize Rakuten Mobile's business potential. · Clarify system requirement ...


  • Tokyo Boston Consulting Group (BCG)

    BCG Xは、グローバルな戦略コンサルティングファームであるボストン コンサルティング グループのテクノロジーとデジタル、デザインの専門家集団で、テクノロジーやデータを駆使したビジネス、およびプロダクトビルディングを担います。 · ...


  • Tokyo Rakuten

    This is a job description for a QA Engineer, Team Lead position at Rakuten Mobile. The role involves leading a QA team and managing BSS software development projects. · Lead a QA team and manage BSS software development projects as a QA Engineer. · Identify Kaizen (improvement) a ...