TokyoのSite Reliability Engineerジョブ

Site Reliability Engineer

3週間前

Tokyo Relocate

Site Reliability Engineer (Bilingual SRE)

Tokyo, Japan

PayPay

Advanced relocation package

Adaptation tips

Flight ticket

Language courses

Money for moving expenses

Temporary housing

Housing search assistance

Visa services

Get the premium jobs newsletter

~100 relocation-friendly tech jobs every Thursday ($12/mo)

Get

About PayPay

PayPay, a FinTech company that has achieved more than 70M (as of July 2025) since its launch in 2018, is rapidly expanding its business scale as a payment platform used by approximately one out of every two smartphone users in Japan.

The company has a diverse team of professionals from more than 50 countries and is building a world-class engineering organization.

Our biggest competitor is "cash".

They are seeking people who can accept this challenge positively, brush up on the product at a tremendous speed that other companies could never achieve, and who are passionate about promoting and spreading such a financial life platform in a short time along with professionalism.

PayPay is committed to maintaining a "Day 1" mindset, bringing to life visions of the future that no one else can even imagine.

At PayPay, we provide an environment where everyone can deliver outstanding performance as professionals, through a flexible work style that includes remote and office.

Experience life in Japan

Featured life in Japan

Position

At PayPay, we're constantly working on improving our systems and processes to support PayPay's exponential growth.

As an SRE at PayPay, we strive towards ensuring high availability and top-level performance so that our users can have flawless and reliable service exceeding expectations.

Considering PayPay's growth, we are looking for experienced SREs who can deliver insights into system bottlenecks and ensure system reliability and scalability, while increasing the number of services that our company offers.

We are looking for individuals who can bring informed and unique viewpoints, enjoy collaborating with a cross-functional team and are actively pushing boundaries to develop reliable and scalable solutions and positive user experiences.

Key Responsibilities

Analyze current technologies used in the company and develop monitoring and notification tools to improve observability and visibility.
Ensure system stability by pre-emptively verifying failure scenarios and implement solutions to reduce MTTR
Develop solutions to improve system performance with a focus on high availability, scalability and resilience
Integrate telemetry and alerting platforms to track and improve reliability of systems
Implement industry best practices for system development, configuration management and system deployment
Ensure seamless flow of information between teams by documenting knowledge gained
Be up to date on modern technologies and trends to advocate for inclusion within products if they add value

Participate in incident management including troubleshooting production issues, driving root cause analysis (RCA) and actively sharing lessons learned to improve system reliability and internal knowledge.

Your qualification

Experience troubleshooting, tuning high performance microservice architectures running on Kubernetes and AWS in highly available production environments.

5+ years experience in software development in Python, Java, Go, etc with strong fundamentals in data structures, algorithms, problem solving and complexity analysis.

During the selection process, you will have a coding challenge.
Curious and proactive in finding performance bottlenecks, scalability and resilience problem areas and addressing them.
Experience with observability tools and gathering data.
Database knowledge such as RDS, NoSQL, distributed TiDB, etc.
Excellent communication skills, collaborative and getting things done attitude.
Enjoy taking up a challenge and driving it to conclusion.
Ability to verbally communicate in both English and Japanese.

Preferred Qualifications

Container image management and optimization.
Experience in large distributed system architecture and capacity planning.
Understanding of IaC, automation tools, terraform, cloud formation, etc.
Background in SRE/DevOps concepts and implementation.
Experience in managing monitoring tools like CloudWatch, VictoriaMetrics, Prometheus and reporting with Snowflake and Sigma.
In depth knowledge of web technologies such as CloudFront, Nginx, etc.

Experience in designing, implementing or maintaining disaster recovery strategies and multi-region architecture to ensure high availability, resilience, and business continuity across critical systems.

Business proficiency level in both English and Japanese.

What we offer

Social Insurance (health insurance, employee pension, employment insurance and compensation insurance)
401K
Translation/Interpretation support
VISA sponsor + Relocation support

Additional details

PayPay 5 senses

Please refer PayPay 5 senses to learn what we value at work.

Working Conditions

Employment Status

Full Time

Office Location

Hybrid Workstyle (flexible working style including Remote and office)
There are no fixed rules regarding office attendance in Product group; it depends on each individual's discretion.
LIFE in JAPAN FACTBOOK

Work Hours

Super Flex Time (No Core Time)

In principle, 9:00am-5:45pm (actual working hours: 7h45m + 1h break)

Holidays

Every Sat/Sun/National holidays (In Japan)/New Year's break/Company-designated Special days

Paid leave

Annual leave (up to 14 days in the first year, granted proportionally according to the month of employment. Can be used from the date of hire)
Personal leave (5 days each year, granted proportionally according to the month of employment)

PayPay's own special paid leave system, which can be used to attend to illnesses, injuries, hospital visits, etc., of the employee, family members, pets, etc.

Salary

Annual salary paid in 12 installments (monthly)
Based on skills, experience, and abilities
Reviewed once a year
Special Incentive once a year *Based on company performance and individual contribution and evaluation
Late overtime allowance
Payroll payment can be changed to digital salary payment "PayPay Paycheck" for an amount set by you

Java Python DevOps AWS Amazon Go Golang Kubernetes NoSQL SRE Prometheus RDS Site Reliability Site Reliability Engineer IaC CloudWatch Amazon Web Services
Show more Show less

Site Reliability Engineer

2ヶ月前

Tokyo TG Japan Inc.. ￥15,000,000 - ￥20,000,000 per year

· ！ · 対象システムの自動化・運用管理・信頼性向上を支援するためのツールを設計・構築する · 対象システム向けのリリースパイプラインの構築および運用支援 · 開発/デリバリーチームの一員として、SREのプラクティスをソリューション設計に組み込む · 設計実装から停止廃止(デコミッショニング)に至るまでのシステムライフサイクル全体を管理する · ...
Site Reliability Engineer

2ヶ月前

Tokyo CLPS Global ￥7,680,000 - ￥11,520,000 per year

システム開発・運用プロジェクトにおいて、DevOps環境の構築・運用を担当いただきます。日本側クライアントとの技術調整・ドキュメント作成を行います。 · ...
Site Reliability Engineer

2週間前

Tokyo 株式会社パワーエックス

SRE/DevOpsチームでは、PowerXのサービスにおける重要な基盤を高いクオリティで実現し、より迅速に・スマートにビジネスを推進させるためのシステム開発・運用を行っています · 蓄電池を利用した新しいサービスにおける高い信頼性を実現するといったチャレンジ · 優秀なSWEと働くことのできる環境 · 自らが設計・技術選択を行い進めていくことができる · ...
Site Reliability Engineer

2ヶ月前

Tokyo TG Japan Inc.. ￥6,000,000 - ￥12,000,000 per year

「欧州系大手コンサルティングファーム」にて、SRE (Site Reliability Engineer) を募集しています。 · ...
Network Site Reliability Engineer

1ヶ月前

Tokyo PlayStation ￥3,600,000 - ￥12,000,000 per year

PlayStationNetworkの企画・設計・開発・運用を担っているエンジニアリング部門です。PlayStationのライフサイクルを構成する、クライアントソフトウェアからゲームコンテンツ配信・販売機能、オンラインゲーム機能、ソーシャルコミュニティ機能等のプラットフォームサービスまで、幅広くコンシューマーやゲームデベロッパーに提供しています。 · SITE RELIABILITY ENGINEERとしてサーバーサイドアプリケーション開発チームの一員としてサービスの信頼性、性能、効率およびセキュリティーの確保を担うこと。 · ...
Site Reliability Engineer

1週間前

Greater Tokyo Area BLOOMTECH, Inc

+時価総額TOP100企業の7割以上が顧客の安定基盤、ハイブリッドワーク×フレックスタイム制で柔軟な働き方を実現、新製品のインフラ基盤をゼロから育てる面白さ。 · グローバル市場で戦う大手企業のグループ経営は、M&Aや海外展開により難易度がますます高まっています。 · 単なる保守運用にとどまらず、サービス設計から開発、長期的なブラッシュアップまで多岐にわたるフェーズに携わっていただきます。 · ...
SRE (Site Reliability Engineer)

3日前

東京都中央区日本橋本町, Thinkings株式会社 Remote job￥4,200,000 per year

+Job summary · インフラ構築・運用の自動化や効率化、障害予防や影響を最小化するための監視やオブザーバビリティ基盤の構築と改善 · +Sonar ATSをはじめとする複数プロダクトの基盤となるインフラやCI/CD基盤の設計・構築・運用 · 各プロダクトのパフォーマンスやスケーラビリティの向上 · +SREもしくはインフラエンジニアとしての経験 3年以上 · + ...
Speeda - SRE (Site Reliability Engineer)

5日前

東京都千代田区丸の内, 株式会社ユーザベース

+自社プロダクト「Speeda」を支えるハイブリッドクラウドの構築・運用を行ったり、パフォーマンスや信頼性、スケーラビリティを高めるエンジニアを募集しています。 · +オンプレミス、GCP、AWSを利用したハイブリッドクラウドの構築 · 開発チームと共にマイクロサービスの開発、運用 · Toil削減 · Docker,Kubernetes,Istioの運用 · ...
Site Reliability Engineer

2週間前

Tokyo PowerX, Inc.

PowerXのサービスにおける高いクオリティで実現し、より迅速に・スマートにビジネスを推進させるためのシステム開発・運用を行うSRE/DevOpsチームでは、優秀なソフトウェアエンジニアを求めています。 · ...
Senior Site Reliability Engineer /215918

5日前

東京都港区東新橋, 株式会社UPSTART Remote job￥10,000,000 - ￥18,000,000 per year

クラウドインフラ・データ分析基盤に深い知見を持つプロダクトマネージャーおよび、dotData 製品開発チームのリーダー陣と協力しながら、製品やサービスに求められる可用性、信頼性、セキュリティなど要件および仕様を明確にしながら、システムアーキテクチャを漸進的に進化させたり、最新のテクノロジーをフル活用して運用の自動化・効率化をしたり、継続的な運用改善を行い、安定した品質で多くのお客様に利用されるサービスを継続的にリリースする役割です。また、中長期にはエンジニアリングマネージャーとして組織面でチームをリードしていく役割やスタッフエンジニアとして技術面でのチー ...
SRE(Site Reliability Engineer)

3週間前

東京都中央区銀座一丁目駅, 株式会社テックドクター Remote job

+ · た, , . · + · . · . · ...
1103_Site Reliability Engineer (SRE)

2週間前

Tokyo TIER IV ￥5,800,000 - ￥16,500,000

インターｦＵＵＶ · ！ · ...
SRE(Site Reliability Engineer)

3週間前

〒- 東京都品川区西五反田, 株式会社ロジレス

私たちは「ECロジスティクスを変革し、日本の未来をスケールする」というミッションのもと、約15兆円規模・成長率3.7%のEC市場に挑んでいます。人手不足や物流コスト増といった深刻な社会課題を解決し、エッチ事業者と倉庫事業者双方の生産性向上を実現することを目指しています。 · AWSを使うインフラ基盤を作って運用します. · モニタリングやログ分析などでシステムがどう動いているか確認します. · パフォーマンス最適化やボトルネック解消も担当します. · ...
1103_Site Reliability Engineer (SRE)

1ヶ月前

Tokyo TIER IV

Job summary/ · /き/ · /き/ · , Autoware-equipped self-driving vehicles around the world to ensure safety and reliability. ...
VPoT直下】SRE(Site Reliability Engineer)

1ヶ月前

Tokyo OLTA株式会社￥7,500,000 - ￥12,000,000

インフラ設計開発運用、サービスダウンタイム最小化、システムパフォーマンススケーラビリティー向上、顧客データ守りセキュリティ品質の向上IaCプロビジョニングモニタリング自動化効率化CI/CD環境開発者体験 · ...
Tech_SRE(Site Reliability Engineer/業務委託)

1ヶ月前

東京都港区虎ノ門, 株式会社TERASS ￥2,000,000 - ￥2,800,000 per year

TERASS(今国) に：" · ： · TERRA.. · SITE RELIABILITY ENGINEER) · ：SRE( · ...
SRE (Site Reliability Engineer) 業務委託

17時間前

Tokyo Tailor

プロダクトづくりの難しい部分を簡単にし、誰もがプロダクトの作り手になれる。これがテイラーが実現したい世界です。誰しもが自分のアイディアを簡単に具現化でき、ビジネスとエンジニアリングの境界を取り払い、多様な専門知識と技術を統合できる世界を目指しています。 · Deployやサーバー構築の自動化やそのためのツール類の開発 · アプリケーションやミドルウエア、クラウドサービスの監視、パフォーマンスチューニング · 障害検知、Capacity Planningなど · ...
Site Reliability Engineer(SRE)/In-house

2ヶ月前

Greater Tokyo Area BLOOMTECH, Inc ￥1,000,000 - ￥12,000,000 per year

ハイブリット×フレックス勤務、業績好調の不動産テック企業、お客様からの信頼性向上を図るための貴重なポジションです。 · ■年収範囲: 年収:6,000,000~12,000,000円 · ...
業務委託】Site Reliability Engineer(SRE)

5日前

東京都品川区西五反田, 株式会社エライク Remote job

+ · 仕事内容海外 e SIM アプリ「トリファ (trifa)」において、インフラ・信頼性・可用性を支える SRE 領域を担当していただきます。 SRE チーム立ち上げフェーズのため、運用改善・自動化・基盤整備を実務面から推進していただきます。主な業務内容・ GCP / AWS を用いたインフラ設計・運用・ CI / CD パイプラインの改善・運用・モニタリング・ロギング基盤の整備 · + · クラウドインフラ運用経験 (3 年以上) IaC の実務経験 CI / CD と障害対応経験可用性とセキュリティ意識した設計経験 · + ...
Customer Reliability Engineer

1ヶ月前

Tokyo LINEヤフー株式会社￥7,000,000 - ￥10,000,000

ポジション概要 · 「LINE」において、Messaging PlatformやDeveloper Product Platformの社内外の顧客が抱える課題を深いドメイン知識と技術力を持って、カスタマーサポート(CS)チーム、開発チームと連携しながら、問題解決と支援ツールの開発をお任せします。 · ...
SRE(Site Reliability Engineer)

4週間前

神奈川県横浜市港北区新横浜, NE株式会社￥6,000,000 - ￥8,000,000 per year

· NEについて NE株式会社は、EC市場において業界トップシェアを誇る EC一元管理SaaS「ネクストエンジン」を運営しているソフトウェア企業です。現在6,500社を超える多くのEC事業者の成長を支援しており 2025年11月に東証グロース市場に上場いたしました。ネクストエンジンは ...

アメリカ大陸

ヨーロッパ

アジア / オセアニア

アフリカ