Description
Join the Chaos Engineering team in Amazon Search. We perform experiments in production to harden Search against outages and make sure that whenever a customer searches for products, they find what they are looking for.
In This Role You Will
- Design, implement, execute, and automate chaos experiments to continuously test Amazon Search' resilience against hardware failures, dependency outages, traffic spikes and more.
- Collaborate with service owners to remedy vulnerabilities, minimize blast radius and harden Amazon Search.
- Research tools and practices in resilience engineering and adopt them as appropriate.
Joining this team, you'll experience the benefits of working in an entrepreneurial environment, while leveraging the resources of (AMZN), one of the world's leading internet companies. We are a diverse, customer-obsessed and passionate team located in Meguro, Tokyo.
Key job responsibilities
- Develop and maintain our chaos experiment orchestrator
- Design, execute, automate, and maintain chaos experiments
- Develop and maintain our distributed load generator
- Develop and maintain our petabyte-scale log archival and query service
- Join a 12/12 on-call rotation for incident response and mitigation
Basic Qualifications
- Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust
Preferred Qualifications
- Experience with Linux/Unix
- Experience in networking, storage systems, operating systems and hands-on systems engineering
- Experience with distributed operational health and performance monitoring systems
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
Company
- Amazon Japan G.K.
Job ID: A3154855
-
Tokyo AmazonJoin the Chaos Engineering team in Amazon Search to perform experiments in production to harden Search against outages. · Design implement execute automate chaos experiments to continuously test Amazon Search' resilience against hardware failures dependency outages traffic spikes ...
-
Tokyo Amazon Full timeWe perform experiments in production to harden Search against outages and make sure that whenever a customer searches for products, they find what they are looking for. · ...
-
Tokyo AmazonJoin the Chaos Engineering team in Amazon Search. · Design, implement, execute, and automate chaos experiments to continuously test Amazon Search' resilience against hardware failures... · Collaborate with service owners to remedy vulnerabilities... · ...
-
Cloud Data Engineer
2週間前
Greater Tokyo Area Randstad JapanRandstad is partnered with a leading Life Insurance firm in their search for an experienced Cloud Data Engineer / SRE with specialization in Data projects. · ...
-
Greater Tokyo Area Randstad JapanRandstad is partnered with a leading Life Insurance firm in their search for an experienced Cloud SRE with specialization in Data projects. · ...
-
Storage Engineer
2ヶ月前
Tokyo SMALL WORLD / Work in Japan? ¥3,500,000 - ¥6,500,000 per yearA Storage Engineer is responsible for designing implementing maintaining and optimizing storage infrastructure ensuring high availability performance security supporting critical business applications data retention strategies. · ...
-
Tokyo Treasure DataOversee our Japan-based Site Reliability Engineering team to ensure availability latency performance efficiency change management monitoring emergency response and capacity planning. · Manage a team of 5-8 Site Reliability Engineers by setting clear expectations and providing con ...
-
Greater Tokyo Area Randstad JapanThe candidate will be responsible for architecting, developing and deploying solutions to automate data pipelines. They must have experience in application development with Python and designing distributed systems. · ...
-
Tokyo ByteDanceThe Search Operations team aims to improve search user experience. · ...
-
Tokyo Agoda ¥7,000,000 - ¥12,000,000 per yearWe are looking for a Technical Product Manager to lead the vision, roadmap, and delivery of Agoda's observability platforms. As a TPM, you will collaborate with engineers, SREs, and data scientists to strengthen our ability to detect, prevent, and resolve production issues faster ...
-
Tokyo Rakuten ¥2,000,000 - ¥2,800,000 per yearWe are looking for Entrepreneurial, Innovative, Growth-Oriented, and Customer-obsessed individuals to join our growing team to build the Telco of the Future. · Ensure high availability, resilience, and scalability across multi-region production environments through automation and ...
-
Tokyo G TalentThe company has historically supported the operational efficiency of pharmacies by providing various solutionsThe challenges facing the Japanese healthcare system are complex, · making the power of technology indispensable. · ...
-
Tokyo ByteDance ¥1,500,000 - ¥2,500,000 per yearThe Search Operations team aims to improve search user experience. · ...
-
Tokyo Rakuten ¥6,000,000 - ¥9,000,000 per yearWe are looking for Entrepreneurial, Innovative, Growth-Oriented, and Customer-obsessed individuals to join our growing team to build the Telco of the Future. · This role contributes to the operational excellence of Rakuten's DevOps and Observability platforms. · Providing proacti ...
-
Tokyo, Japan CybereasonWe are seeking an experienced Hands-On Rust Engineering Team Lead to lead a team of talented engineers while remaining deeply involved in architecture, design, and development. · Design, develop, and maintain scalable backend services and selected front-end components for our pla ...
-
Tokyo Rakuten ¥6,000,000 - ¥12,000,000 per yearThe Leisure Product Department (LPD) is handling a lineup of lifestyle and leisure related services, some of them being category leaders in the Japanese market. We aim at growing globally and becoming world leaders through innovation and technology. · We are looking for an experi ...
-
Tokyo Rakuten+We are looking for an experienced full-stack site reliability engineer who has a passion for working on complex/large systems and understands the importance of maintaining and supporting one. · +Design & Develop features on small to large scale systems · Handle operations like r ...
-
Tokyo Cybereason ¥18,000,000 - ¥21,600,000 per yearCybereason is on a mission to reverse the adversary's advantage by empowering defenders with ingenuity and technology to end cyber-attacks. · We have the technology, and now we are looking to expand our talent Join a market leader and a diverse team of passionate professionals wh ...
-
Tokyo, Tokyo G TalentDesign construction and operation of infrastructure infrastructure AWS planning and implementation of measures to maximize development productivity improvement of system reliability and performance detection response to and prevention of service failures introduction development ...
-
Tokyo TikTokThe Data Delivery & Operation team is at the heart of this mission, ensuring that every labeled dataset fuels our algorithms to understand intent, improve relevance, and enhance user discovery. · ...
-
Tokyo IBMThe Resident Solutions Engineer (RSE) combines automation, infrastructure management, and reliability practices to ensure efficient software delivery and system stability. · ...