Treasure Data:
At Treasure Data, we're on a mission to radically simplify how companies use data and AI to create connected customer experiences. Our intelligent customer data platform (CDP) drives revenue growth and operational efficiency across the enterprise to deliver powerful business outcomes.
We are thrilled that Forrester has recognized Treasure Data as a Leader in The Forrester Wave: Customer Data Platforms For B2C. It's an honor to be acknowledged for our efforts in advancing the CDP industry with cutting-edge AI and real-time capabilities.
Furthermore, Treasure Data employees are enthusiastic, data-driven, and customer-obsessed. We are a team of drivers—self-starters who take initiative, anticipate needs, and proactively jump in to solve problems. Our actions reflect our values of honesty, reliability, openness, and humility.
Your Role:
Your role will be to oversee our Japan-based Site Reliability Engineering team. Our SREs own our compute platform (AWS, Kubernetes, EC2, Lambda, ECS), our common tooling, and our overall site availability. They work directly with development teams to solve product challenges and provide education around best practices. As our SRE leader in Japan, you'll work closely with your North-America-based counterparts to design and implement solutions to solve high-scale challenges.
Managers at Treasure Data prioritize solving people and communication challenges before technical problems, but are still active technical contributors. They are eager to build effective and dynamic teams that iteratively and rapidly deliver resilient systems. It will require working across product and engineering teams on complex problems where solutions require in-depth analysis and evaluation of multiple competing factors, identifying the best trade-offs for successful delivery.
This role requires leadership by example and will have you making regular individual contributions. The team and you will be directly responsible for solutions for the platform in these critical areas: availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. Additionally, as a leader within the engineering organization you'll be a part of broader planning and ultimately aligning your team with the outcomes.
Success in this role requires a passion for helping others and improving their lives. You do this by working with people to make team collaboration more effective and by helping them simplify complex systems to make them understandable and operable. You are able to effectively communicate decisions, ideas, designs, and operation of systems and services clearly and concisely but more importantly, derive a lot of satisfaction from teaching and enabling others to do this as well.
Responsibilities & Duties:
Manage a team of 5-8 Site Reliability Engineers by setting clear expectations and providing continuous feedback.
Providing ongoing career coaching on both technical and non-technical areas of improvement.
Working with Engineering and Product stakeholders to organize and execute on large projects.
Planning and facilitating agile sprints and holding the team accountable to sprint deliverables.
Improving processes by introducing metrics, experimenting with improvements, and implementing new ways of working.
Assisting with incident coordination as part of our on-call rotation.
Assisting with system design activities to make the right tradeoffs that balance reliability and delivery speed, and communicating those decisions clearly.
Required Qualifications:
Proven experience as a people manager for a technical team, including coaching, performance management, and delivering difficult feedback when necessary.
Experience managing or supporting a distributed SRE or infrastructure team across multiple time zones.
Hands-on experience with at least one major cloud provider such as AWS, Azure, or GCP.
Working familiarity with infrastructure-as-code tools, including Terraform, CloudFormation, CDK, or Ansible.
Working knowledge of at least one programming language, such as Python, Java, Ruby, or JavaScript.
Experience leading or participating in production incident response, including incident command and post-incident review.
Demonstrated ability to lead complex, cross-team software or platform initiatives from planning through delivery.
Working knowledge of agile software development practices and backlog-driven delivery.
Understanding of cloud governance fundamentals, including cost management, patching, and secure system design.
Strong communication and leadership skills, with the ability to represent reliability concerns to engineering and senior leadership.
Language Requirements:
The official language for written and verbal communication for this position is English, but Japanese fluency is strongly preferred.
Physical Requirements:
Hybrid - 3-days in office in Tokyo per week
Travel Requirements:
Minimum once a year for Team onsite.
About Treasure Data:
Treasure Data is the Intelligent Customer Data Platform (CDP) built for enterprise scale and powered by AI. Recognized as a Leader by Forrester and IDC, Treasure Data empowers the world's largest and most innovative companies to deliver hyper-personalized customer experiences at scale that increase revenue, reduce costs, and build trust.
Through unique capabilities such as the Diamond Record, AI Agent Foundry, and AI Decisioning with Real-Time Personalization, Treasure Data enables marketing and CX teams to personalize cross-channel engagement in real-time, optimize marketing spend while increasing ROI, and drive customer lifetime value through more intelligent retention and loyalty.
Our Dedication to You:
We value and promote diversity, equity, inclusion, and belonging in all aspects of our business and at all levels. Success comes from acknowledging, welcoming, and incorporating diverse perspectives.
Diverse representation alone is not the desired outcome. We also strive to create an inclusive culture that encourages growth, ownership of your role, and achieving innovation in new and unique ways. Your voice will be heard, and we will help amplify it.
Agencies and Recruiters:
We cannot consider your candidate(s) without a contract in place. Any resumes received without having an active agreement will be considered gratis referrals to us. Thank you for your understanding and cooperation
-
Tokyo TG Japan Inc.. ¥15,000,000 - ¥20,000,000 per year· ! · 対象システムの自動化・運用管理・信頼性向上を支援するためのツールを設計・構築する · 対象システム向けのリリースパイプラインの構築および運用支援 · 開発/デリバリーチームの一員として、SREのプラクティスをソリューション設計に組み込む · 設計実装から停止廃止(デコミッショニング)に至るまでのシステムライフサイクル全体を管理する · ...
-
Tokyo CLPS Global ¥7,680,000 - ¥11,520,000 per yearシステム開発・運用プロジェクトにおいて、DevOps環境の構築・運用を担当いただきます。日本側クライアントとの技術調整・ドキュメント作成を行います。 · ...
-
Tokyo TG Japan Inc.. ¥6,000,000 - ¥12,000,000 per year「欧州系大手コンサルティングファーム」にて、SRE (Site Reliability Engineer) を募集しています。 · ...
-
Tokyo PlayStation ¥3,600,000 - ¥12,000,000 per yearPlayStationNetworkの企画・設計・開発・運用を担っているエンジニアリング部門です。PlayStationのライフサイクルを構成する、クライアントソフトウェアからゲームコンテンツ配信・販売機能、オンラインゲーム機能、ソーシャルコミュニティ機能等のプラットフォームサービスまで、幅広くコンシューマーやゲームデベロッパーに提供しています。 · SITE RELIABILITY ENGINEERとしてサーバーサイドアプリケーション開発チームの一員としてサービスの信頼性、性能、効率およびセキュリティーの確保を担うこと。 · ...
-
Tokyo BLOOMTECH, Inc ¥5,500,000 - ¥7,500,000 per year急拡大中の自社サービスを牽引するSRE(Site Reliability Engineering)ポジションを任せます。具体的には「どのようにしたらサービスをより多くの方に、より便利に使ってもらえるか」というユーザー視点に立ち、仮説・実行・検証のサイクルを回しながら、サービスの信頼性を高めて頂きます。 · SLA/SLO/SLIの設定・監視、モニタリング環境の改善 · OS、ミドルウェアなどの継続的アップデート · 障害対応およびボトルネック調査・対応 · AWSなど複数クラウドを使用したシステム環境の運用安定化 · アーキテクチャ改善(マイクロサービス ...
-
Tokyo Tailorプロダクトづくりの難しい部分を簡単にし、誰もがプロダクトの作り手になれる。これがテイラーが実現したい世界です。 · ...
-
Tokyo BLOOMTECH, Inc ¥8,000,000 - ¥18,000,000 per yearデカコーン(企業価値100億ドル以上のスタートアップ)を目指す当社ですが、この目標を実現するためには「グローバル×ディープテックで勝つ必要がある」とよく言われます。 · そのような中で、「日本発」の「グローバル×ディープテック」として、「デファクトとなるインフラ」を先陣をきってつくるべく、現在、開発チームの人員を中心に採用を急拡大しております。 · ...
-
Tokyo TailorTailor Platformを活用し、企業の経営課題に対してソリューションを提供することで事業の循環を生み、Tailor Platformの進化で課題解決を加速していきます。 · ...
-
Tokyo Treasure Data Full timeTreasure Data is seeking a Site Reliability Operations Manager to oversee our Japan-based Site Reliability Engineering team. The successful candidate will work closely with North-America-based counterparts to design and implement solutions for high-scale challenges. · ...
-
Tokyo Treasure DataTreasure Data employees are enthusiastic, data-driven, and customer-obsessedWe value and promote diversity,equity,inclusion,and belonging in all aspects of our businessand at all levels.Success comes from acknowledging,welcoming,and incorporating diverse perspectives. ...
-
Tokyo Treasure DataTreasure Data is seeking a Site Reliability Operations Manager to oversee the Japan-based Site Reliability Engineering team.We are a team of drivers—self-starters who take initiative, anticipate needs, and proactively jump in to solve problems. · Manage a team of 5-8 Site Reliabi ...
-
Operations Manager
4週間前
東京都 区, パーソルキャリア株式会社 ¥2,400,000 - ¥3,600,000 per year組織またはプロジェクトマネジメント経験必須をお持ちの方必見です · 同社は物流システム業界においてグローバルtopメーカーのグループ会社として空港向け搬送機器の運用やメンテナンスを行っています。 · ...
-
Tokyo LTL Language School ¥240,000 - ¥350,000外国文化および現地文化に興味を持つ日本の方が必須 · 明るい性格で、様々な文化の人々と話すことを楽しむ方。 · 良好な英語力 · Tokyo在住 · ...
-
Tokyo TIER IVJob summary/ · /き/ · /き/ · , Autoware-equipped self-driving vehicles around the world to ensure safety and reliability. ...
-
Tokyo OLTA株式会社 ¥7,500,000 - ¥12,000,000インフラ設計開発運用、サービスダウンタイム最小化、システムパフォーマンススケーラビリティー向上、顧客データ守りセキュリティ品質の向上IaCプロビジョニングモニタリング自動化効率化CI/CD環境開発者体験 · ...
-
Tokyo Amazon Full time¥9,000,000 - ¥12,000,000 per yearセントラルオペレーションは、当社の配達ビジネスであるAmazon Logistics (AMZL) の新しい取り組みです。急速に拡大している物流の最後の区間、ラストマイルネットワークを運営するためのOperationを構築します。セントラルオペレーションは、注文数の予測や配送ドライバー台数の計画、運用方法を改革していくことを目指した、AMZLの要となる重要な取り組みです。 · Programの計画立案 · Shift Operation管理、業務割当管理 · Stakeholder (Delivery Associates/Delivery Servic ...
-
Tokyo Tailor ¥1,200,000 - ¥1,500,000 per year+Job summary+これがテイラーが実現したい世界です。誰しもが自分のアイディアを簡単に具現化でき、ビジネスとエンジニアリングの境界を取り払い、多様な専門知識と技術を統合できる世界を目指しています。 · +Qualifications+Site Reliability Engineeringという文化・思想 · ...
-
Tokyo Amazon ¥1,000,000 - ¥1,500,000 per yearセントラルオペレーションは、当社の配達ビジネスであるAmazon Logistics (AMZL) の新しい取り組みです。急速に拡大している物流の最後の区間、ラストマイルネットワークを運営するためのOperationを構築します。セントラルオペレーションは、注文数の予測や配送ドライバー台数の計画、運用方法を改革していくことを目指した、AMZLの要となる重要な取り組みです。 · チームメンバーの育成とサポート · 担当プロセスの品質・作業効率の向上 · 日々のオペレーション対応 · プロジェクトリーダーとして改善を推進 · 1+ years of empl ...
-
東京都 港区 虎ノ門, 株式会社TERASS ¥2,000,000 - ¥2,800,000 per yearTERASS(今国) に:" · : · TERRA.. · SITE RELIABILITY ENGINEER) · :SRE( · ...
-
Tokyo Tailor ¥500,000 - ¥1,000,000 per year+プロダクトづくりの難しい部分を簡単にし、誰もがプロダクトの作り手になれる。これがテイラーが実現したい世界です。誰しもが自分のアイディアを簡単に具現化でき、ビジネスとエンジニアリングの境界を取り払い、多様な専門知識と技術を統合できる世界を目指しています。 · + · +Tailor Platformは業務システムを作るプラットフォームで、今までものごとの仕事をする人たちにとってもっと便利にすることを目標にしています。+ · ...
-
Tokyo Treasure DataOversee our Japan-based Site Reliability Engineering team to ensure availability latency performance efficiency change management monitoring emergency response and capacity planning. · Manage a team of 5-8 Site Reliability Engineers by setting clear expectations and providing con ...