Tokyo PaleBlueDot AI ¥1,500,000 - ¥2,500,000 per year

Location:
Remote, with willingness to travel overseas
Core Responsibilities

  • Strategic Design & Architecture Planning
Lead the end-to-end architecture design of overseas AI compute clusters, covering compute, network, storage, and liquid-cooling systems.
Deeply understand clients' AI workload requirements and translate them into advanced, reliable, and scalable technical solutions.

  • End-to-End Construction & Delivery Management

Take full responsibility for the entire lifecycle of overseas, multi-thousand-GPU AI cluster deployments — from planning, equipment procurement, acceptance testing, installation, cabling, and commissioning to final go-live.

Lead and continuously optimize cluster deployment processes to ensure delivery within strict timelines and budget constraints.
Coordinate with data center facility teams, hardware vendors, and liquid-cooling suppliers to ensure seamless integration across all stages.

  • Operations Management & Incident Response

Build and lead a high-performance, multicultural operations team overseas; establish a 24/7 operations framework, standard operating procedures (SOPs), and emergency response protocols.

Develop comprehensive monitoring, alerting, logging, and performance analysis platforms for full observability and health management of the clusters.
Serve as the senior escalation point for complex, high-impact technical issues; lead root cause analysis and drive systematic improvements.

  • Client & Technical Interface

Act as the technical authority interfacing directly with clients' engineering teams, delivering technical presentations, POC support, and deep-dive technical discussions.

Ensure cluster services meet or exceed client SLA expectations, enhancing customer satisfaction and long-term partnership.

  • Operations Efficiency & Cost Optimization
Continuously improve operational efficiency of clusters, focusing on key metrics such as PUE, WUE, and compute utilization.
Manage the operations department's budget; pursue cost optimization opportunities while maintaining service excellence.
Qualifications


Experience:

Bachelor's degree or above in Computer Science, Electrical Engineering, or a related field.
Minimum 10 years of experience in large-scale data center or HPC/AI cluster operations and management.
Overseas Project

Experience:

Proven track record in the successful delivery of advanced AI compute clusters or hyperscale data centers abroad.
Deep understanding of overseas project operational models, compliance requirements, and cultural differences.

Architecture Expertise:
Proficient in AI cluster architectures (e.g., NVIDIA DGX/SuperPOD, GPU-as-a-Service).
Strong understanding of InfiniBand/RoCE networking and distributed storage systems.

Liquid Cooling Technology:
Hands-on experience deploying or operating immersion or cold-plate liquid-cooled clusters.
Familiarity with their principles, operational challenges, and associated risks.

Systems Operations:


Expertise in Linux environments, cluster schedulers (e.g., Slurm, Kubernetes), monitoring tools (e.g., Prometheus, Grafana), and automation frameworks (e.g., Ansible, Python).


Leadership:
Minimum 5 years of management experience leading technical teams.
Ability to build, lead, and motivate high-performing engineering teams in multicultural environments.

Customer Orientation:
Excellent communication and presentation skills for effective technical discussions with internal and external stakeholders.

Language Proficiency:
Fluent English communication skills (both written and spoken) are preferred.
Show more Show less

  • Tokyo PaleBlueDot AI ¥1,500,000 - ¥2,500,000 per year

    Lead the end-to-end architecture design of overseas AI compute clusters, covering compute, network, storage, and liquid-cooling systems. Take full responsibility for the entire lifecycle of overseas, multi-thousand-GPU AI cluster deployments — from planning, equipment procurement ...


  • Tokyo IBM ¥4,000,000 - ¥12,000,000 per year

    IBMグループでキャリアを育くむ社員は、世界各国のお客様とIBMとのリレーションをさらに深め、更なる協業を推進していく役割を担います。 · あなたは、様々な専門性を持った有識者との協業を通じて、各業界を代表するお客様のハイブリッドクラウドとAIを活用した変革をご支援します。 · 大手自動車メーカー様Oracle EBS関連案件の開発/保守について、お客様である大手自動車メーカー様と開発・保守を担うIBMインドチームとのブリッジSEとして橋渡しを担う。 · PM/PMOとして同領域の各種案件を管理・推進する。 · 初任業務完了後は、同大手自動車メーカー様の ...


  • Tokyo IBM ¥5,400,000 - ¥10,800,000 per year

    Salesforceテクニカル・スペシャリストの役割は、IBMグループにおけるキャリアを育むための社員が世界各国のお客様とIBMとのリレーションをさらに深め、更なる協業を推進していくことを担います。様々な専門性を持った有識者との協業により、お客様に価値ある変革をもたらすクリエイティブなソリューションを作成します。 · ...


  • Tokyo Capgemini Full time¥6,000,000 - ¥12,000,000 per year

    ソフトウェアエンジニアとして、DXに関連する数億円規模のシステム開発プロジェクトで要件定義、設計、開発などの業務に携わる。プロジェクトは大手企業からのプライム案件がメインとなり、全行程を外部に発注せず一気通貫で内部完結している。クラウドネイティブアーキテクチャ、コンテナ、マイクロサービスなど先端技術に注力しているため、様々な領域にチャレンジできる。国内プロジェクト、グローバルプロジェクト、いずれも参画の可能性がある。 · ソフトウェアエンジニアとして、DXに関連する数億円規模のシステム開発プロジェクトで要件定義、設計、開発などの業務に携わる。 · プロジ ...


  • Tokyo Capgemini Full time¥7,500,000 - ¥15,000,000

    クラウドスペシャリストとしてクラウドの各種マネージドサービスのアーキテクチャリングや提案活動の支援、設計構築、新たなサービスの技術検証をおこなっていただきます。 · クラウドネイティブアーキテクチャの検討および提案活動支援 · クラウドネイティブアーキテクチャの設計・構築 · クラウドを利用しているプロジェクトの技術支援 · オンプレミス、AWS、Azure、GCPいずれかでのインフラ設計・構築経験3年以上 · オンプレミスからAWS、Azure、GCPへの移行経験 · 論理的思考、コミュニケーション力 · ネイティブレベルの日本語力 · フレックスタイ ...


  • Japan EY Studio+ Nederland ¥3,600,000 - ¥10,800,000 per year

    サポートエンジニアとしてEY Japanで働くことができます。IT基盤の運用、プロダクションサポート、品質向上などを責任を持って実行することができます。 · ...


  • Tokyo CBRE Japan ¥6,000,000 - ¥12,000,000 per year

    The purpose of this position is to oversee the IFM delivery such as manage multiple functions of building operations and maintenance for a facility, campus, or portfolio of buildings of significant complexity, across a portfolio of client's sites in APAC (Japan and South Korea, t ...


  • Tokyo GlobalStyle/株式会社 グローバルスタイル ¥4,000,000 - ¥8,000,000 per year

    We are looking for a skilled OpenSearch Engineer to design, deploy, and manage our high-performance, distributed search and analytics platform. · Design, implement, and maintain large-scale OpenSearch/Elasticsearch clusters for various use cases. · Develop, optimize, and tune com ...


  • Tokyo CBRE Asia Pacific $80,000 - $120,000 per year

    The purpose of this position is to oversee the IFM delivery such as manage multiple functions of building operations and maintenance for a facility, campus, or portfolio of buildings of significant complexity, across a portfolio of client's sites in APAC (Japan and South Korea, t ...


  • Tokyo Treasure Data ¥104,000 - ¥250,000 per year

    We are thrilled that Forrester has recognized Treasure Data as a Leader in · The Forrester Wave: · Customer Data Platforms For B2C.It's an honor to be acknowledged for our efforts in advancing the CDP industry with cutting-edge AI and real-time capabilities. · ...


  • Tokyo CBRE Japan ¥3,500,000 - ¥6,000,000 per year

    The purpose of this position is to oversee the IFM services delivery of building operations and maintenance for a facility, campus, or buildings, across a portfolio of client's sites in Japan, with cross supporting & management of sites in other location / countries within the co ...


  • Tokyo TEKsystems

    Join one of the most influential insurance companies in Japan, known for its robust infrastructure and cutting-edge technology. With a global footprint and significant investments in digital transformation, this organization manages a massive infrastructure, leveraging multicloud ...

  • MLOps Engineer

    1ヶ月前


    Tokyo AI Robot Association Full time¥4,000,000 - ¥12,000,000 per year

    We are launching a groundbreaking initiative: collecting one million hours of humanoid robot operation data with hundreds of robots, and leveraging it to train the world's most powerful Vision-Language-Action (VLA) models. · Design, implement, and maintain large-scale ML pipeline ...


  • Tokyo NVIDIA ¥120,000 - ¥180,000 per year

    NVIDIA is looking for Senior Cloud Infrastructure/DevOps Solutions Architect to join its NVIDIA Infrastructure Specialist Team. · Maintain large scale HPC/AI clusters with monitoring, logging and alerting · Manage Linux job/workload schedulers and orchestration tools · Develop an ...


  • Tokyo Inter-American Development Bank ¥600,000 - ¥1,200,000 per year

    We are seeking a senior professional with experience in knowledge management, developmental and content editing, teaching and communications for evaluation in development. The IDB Group is a community of diverse, versatile, and passionate people who come together on a journey to ...


  • Tokyo TEKsystems ¥1,750,000 - ¥2,500,000 per year

    This role plays a critical role in transforming transportation—designing scalable, · cloud-native infrastructure that powers AI and machine learning across global · platforms. · ...


  • Tokyo Relocate ¥8,000,000 - ¥15,000,000 per year

    At the core of our architecture in PayPay, we use Kafka for high-performance data streaming, and every payment that goes through the app is handled by multiple topics in Kafka. To keep up with the challenges of our growing product and future expansion, we are expanding our Stream ...


  • Tokyo The Peninsula Hotels ¥1,800,000 - ¥2,500,000 per year

    The Assistant Director of Engineering is responsible for overseeing the department, ensuring the smooth operation and maintenance of all mechanical and technical systems. · Excellence technical knowledge in mechanical, electrical and civil works. · Prior experiences in building m ...


  • Tokyo AI Robot Association Freelance¥10,000,000 - ¥20,000,000 per year

    As an Infrastructure Specialist supporting AI × robot foundation model development at AIRoA, you will be responsible for the design, construction, · and operation of our infrastructure (cloud platforms, on-premise environments, · etc.).To provide GPU clusters, · large-scale data ...


  • Tokyo NVIDIA ¥1,200,000 - ¥2,400,000 per year

    NVIDIA is a world leader in computer graphics, artificial intelligence, and accelerated computing. We are looking for a Solution Architect to work with enterprise companies in Japan and partners, promoting the adoption and providing technical support to enable them to use our por ...

  • Partner Ops Analyst

    2ヶ月前


    New York Dataiku

    We're seeking a highly analytical and results-driven Senior Partner Operations Analyst to join our Revenue Operations team in Singapore. This role will primarily support the Global Partner Organization (Partner Operations) team by providing insights, operational support, and stra ...