Tokyo Amazon

Description
Join the Chaos Engineering team in Amazon Search. We perform experiments in production to harden Search against outages and make sure that whenever a customer searches for products, they find what they are looking for.

In This Role You Will

  • Design, implement, execute, and automate chaos experiments to continuously test Amazon Search' resilience against hardware failures, dependency outages, traffic spikes and more.
  • Collaborate with service owners to remedy vulnerabilities, minimize blast radius and harden Amazon Search.
  • Research tools and practices in resilience engineering and adopt them as appropriate.

Joining this team, you'll experience the benefits of working in an entrepreneurial environment, while leveraging the resources of (AMZN), one of the world's leading internet companies. We are a diverse, customer-obsessed and passionate team located in Meguro, Tokyo.

Key job responsibilities

  • Develop and maintain our chaos experiment orchestrator
  • Design, execute, automate, and maintain chaos experiments
  • Develop and maintain our distributed load generator
  • Develop and maintain our petabyte-scale log archival and query service
  • Join a 12/12 on-call rotation for incident response and mitigation

Basic Qualifications

  • Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust

Preferred Qualifications

  • Experience with Linux/Unix
  • Experience in networking, storage systems, operating systems and hands-on systems engineering
  • Experience with distributed operational health and performance monitoring systems

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.

Company
- Amazon Japan G.K.

Job ID: A3154855



  • Tokyo Amazon

    Join the Chaos Engineering team in Amazon Search to perform experiments in production to harden Search against outages. · Design implement execute automate chaos experiments to continuously test Amazon Search' resilience against hardware failures dependency outages traffic spikes ...


  • Tokyo Amazon Full time

    We perform experiments in production to harden Search against outages and make sure that whenever a customer searches for products, they find what they are looking for. · ...


  • Tokyo Amazon

    Join the Chaos Engineering team in Amazon Search. · Design, implement, execute, and automate chaos experiments to continuously test Amazon Search' resilience against hardware failures... · Collaborate with service owners to remedy vulnerabilities... · ...

  • Cloud Data Engineer

    2週間前


    Greater Tokyo Area Randstad Japan

    Randstad is partnered with a leading Life Insurance firm in their search for an experienced Cloud Data Engineer / SRE with specialization in Data projects. · ...


  • Greater Tokyo Area Randstad Japan

    Randstad is partnered with a leading Life Insurance firm in their search for an experienced Cloud SRE with specialization in Data projects. · ...

  • Storage Engineer

    2ヶ月前


    Tokyo SMALL WORLD / Work in Japan? ¥3,500,000 - ¥6,500,000 per year

    A Storage Engineer is responsible for designing implementing maintaining and optimizing storage infrastructure ensuring high availability performance security supporting critical business applications data retention strategies. · ...


  • Tokyo Treasure Data

    Oversee our Japan-based Site Reliability Engineering team to ensure availability latency performance efficiency change management monitoring emergency response and capacity planning. · Manage a team of 5-8 Site Reliability Engineers by setting clear expectations and providing con ...


  • Greater Tokyo Area Randstad Japan

    The candidate will be responsible for architecting, developing and deploying solutions to automate data pipelines. They must have experience in application development with Python and designing distributed systems. · ...


  • Tokyo ByteDance

    The Search Operations team aims to improve search user experience. · ...


  • Tokyo Agoda ¥7,000,000 - ¥12,000,000 per year

    We are looking for a Technical Product Manager to lead the vision, roadmap, and delivery of Agoda's observability platforms. As a TPM, you will collaborate with engineers, SREs, and data scientists to strengthen our ability to detect, prevent, and resolve production issues faster ...


  • Tokyo Rakuten ¥2,000,000 - ¥2,800,000 per year

    We are looking for Entrepreneurial, Innovative, Growth-Oriented, and Customer-obsessed individuals to join our growing team to build the Telco of the Future. · Ensure high availability, resilience, and scalability across multi-region production environments through automation and ...


  • Tokyo G Talent

    The company has historically supported the operational efficiency of pharmacies by providing various solutionsThe challenges facing the Japanese healthcare system are complex, · making the power of technology indispensable. · ...


  • Tokyo ByteDance ¥1,500,000 - ¥2,500,000 per year

    The Search Operations team aims to improve search user experience. · ...


  • Tokyo Rakuten ¥6,000,000 - ¥9,000,000 per year

    We are looking for Entrepreneurial, Innovative, Growth-Oriented, and Customer-obsessed individuals to join our growing team to build the Telco of the Future. · This role contributes to the operational excellence of Rakuten's DevOps and Observability platforms. · Providing proacti ...


  • Tokyo, Japan Cybereason

    We are seeking an experienced Hands-On Rust Engineering Team Lead to lead a team of talented engineers while remaining deeply involved in architecture, design, and development. · Design, develop, and maintain scalable backend services and selected front-end components for our pla ...


  • Tokyo Rakuten ¥6,000,000 - ¥12,000,000 per year

    The Leisure Product Department (LPD) is handling a lineup of lifestyle and leisure related services, some of them being category leaders in the Japanese market. We aim at growing globally and becoming world leaders through innovation and technology. · We are looking for an experi ...


  • Tokyo Rakuten

    +We are looking for an experienced full-stack site reliability engineer who has a passion for working on complex/large systems and understands the importance of maintaining and supporting one. · +Design & Develop features on small to large scale systems · Handle operations like r ...


  • Tokyo Cybereason ¥18,000,000 - ¥21,600,000 per year

    Cybereason is on a mission to reverse the adversary's advantage by empowering defenders with ingenuity and technology to end cyber-attacks. · We have the technology, and now we are looking to expand our talent Join a market leader and a diverse team of passionate professionals wh ...


  • Tokyo, Tokyo G Talent

    Design construction and operation of infrastructure infrastructure AWS planning and implementation of measures to maximize development productivity improvement of system reliability and performance detection response to and prevention of service failures introduction development ...


  • Tokyo TikTok

    The Data Delivery & Operation team is at the heart of this mission, ensuring that every labeled dataset fuels our algorithms to understand intent, improve relevance, and enhance user discovery. · ...


  • Tokyo IBM

    The Resident Solutions Engineer (RSE) combines automation, infrastructure management, and reliability practices to ensure efficient software delivery and system stability. · ...