MLOps Engineer

6日前


Tokyo Rakuten Full time
Description

Rakuten is one of the world's leading e-commerce site operators, with a mission to empower people and society through the internet. We are striving to become a global innovation company while expanding various businesses.

The Machine learning and Deep learning Engineering Department (MDE) is a group of engineers and scientists who specialize in natural language processing (NLP), search, and recommendation systems. We conduct state-of-the-art research and apply cutting-edge technologies, such as transformer model, dense retrieval, distributed GPU training, and large-scale machine learning, to a variety of Rakuten products and services. We are looking for passionate experts in machine learning research and engineering to join us in our journey to define the next-generation e-commerce experience.

The GPU Engineering team is at the forefront of delivering a robust GPU infrastructure and cutting-edge ML platforms that powers the development and deployment of ML models across various teams of ML engineers and researchers within Rakuten. Use cases include semantic search, visual search, recommendation, LLMs, and more.

Position:

Why We Hire

As an MLOps Engineer in the GPU Engineering team, you will be at the heart of Rakuten's ML operations, focusing on the deployment, monitoring, and management of ML models. You'll work closely with ML Engineers across the department to provide a reliable infrastructure that supports rapid model development, training, and deployment. Your expertise will contribute to the efficiency and scalability of our ML projects, directly impacting Rakuten's product innovation and service excellence.

Position Details

Key Responsibilities:
  • Design, implement, and maintain ML pipelines for automated training, testing, and deployment of machine learning models, ensuring scalability and efficiency.
  • Work collaboratively with ML engineers to troubleshoot and optimize model performance, ensuring models are production-ready and meet defined SLAs.
  • Manage and monitor Kubernetes clusters and related infrastructure to support high-volume ML workloads, implementing best practices for security and resilience.
  • Develop and maintain documentation on ML infrastructure, tools, and best practices, providing guidance and support to ML teams.
  • Continuously evaluate and incorporate new technologies and tools to enhance the ML platform's capabilities and performance.

Mandatory Qualifications:
  • Experience: 1 year or more of experience in MLOps, with a proven track record of managing ML infrastructure and pipelines.
  • Education: Bachelor's or higher degree in Computer Science, Engineering, or a related technical discipline.
  • Kubernetes Proficiency: Deep understanding of Kubernetes (K8s) infrastructure and its application in managing ML workloads.
  • Programming Skills: Proficiency in Python and familiarity with ML frameworks (e.g., TensorFlow, PyTorch).
  • CI/CD Tools: Experience with CI/CD tools (e.g., GitHub Actions, Jenkins, GitLab CI) and container technologies (e.g., Docker).
  • Strong communication and teamwork skills.
  • Passion for technology and solving challenging problems.

Desired Qualifications:
  • Familiarity with CUDA
  • Experience training large models, including LLMs

Languages:
English (Overall - 4 - Fluent)
  • MLOps Engineer

    6日前


    Tokyo Rakuten Full time

    Description · : Business Overview · Rakuten is one of the world's leading e-commerce site operators, with a mission to empower people and society through the internet. We are striving to become a global innovation company while expanding various businesses. · Department Overvi ...


  • Tokyo PayPay Corporation

    PayPayについて · 2018年にサービスを開始してから約5年でユーザー数6300万人を突破したフィンテック企業であるPayPayは約50か国の国と地域から集まった多様なメンバーで構成されています。 · OUR VISION IS UNLIMITED_ · 我々は自分たちの想像を超える未来を創るためにあえて明確なビジョンは必要ないと考えています。常にDay1であるスタンスを忘れずに、誰もが想像できないようなビジョン(未来)を実現していくのがPayPayです。 · この壮大なビジョンに前向きに取り組み、他社に真似できない圧倒的なスピードでプロダク ...


  • Tokyo Rakuten Full time

    Description · : Business Overview The Technology Platforms Division (TPD) is responsible for building and operating the infrastructure and ecosystem platforms which power the Rakuten Group. Our mission is to provide our Rakuten Cloud and Ecosystem Platforms which will deliver C ...


  • Tokyo Amazon Japan G.K. Full time

    About the Role · We are seeking a senior data engineer to join our Data Services and Technologies team in Tokyo, Japan. The ideal candidate will be passionate about working with data and eager to make a broad business impact in the rapidly evolving e-commerce industry. · As a key ...

  • Software Engineer

    22時間前


    Tokyo Renesas Electronics Full time

    Job Title: MLOps Engineer · In the AI & Cloud Engineering (ACE) Division, you will be part of a team developing a comprehensive AI strategy to deliver a highly flexible platform for exploring new Deep Learning / Machine Learning model architectures. · This role involves designing ...


  • Tokyo Cogent Labs Full time

    About Cogent · Cogent Labs is a leading innovator in intelligent automation, dedicated to improving people's quality of work and life since 2014. · With a deep understanding of customers' needs and practices, we build products that leverage the power of custom AI models through c ...


  • Tokyo NVIDIA Full time

    NVIDIA seeks a Solutions Architect or Data Scientist to collaborate with global alliance partners on GPU Accelerated Computing solutions. We require an individual with expertise in Machine Learning and Deep Learning, specifically Generative AI. · We need a passionate, hard-workin ...


  • Tokyo NVIDIA Full time

    NVIDIA's Worldwide Field Operations (WWFO) team is looking for an experienced systems and network infrastructure Solutions Architect. Do you want to be part of a team that brings new Artificial Intelligence (AI) hardware and software technologies to cloud providers? We are lookin ...


  • Tokyo PayPay Corporation

    About PayPay · PayPay is a FinTech company that has grown to over 65M (as of August 2024) users since its launch in 2018. Our team is hugely diverse with members from over 50 different countries. · OUR VISION IS UNLIMITED_ · We dare to believe that we do not need a clear visi ...


  • Tokyo PayPay Corporation

    About PayPay · PayPay is a FinTech company that has grown to over 65M (as of August 2024) users since its launch in 2018. Our team is hugely diverse with members from over 50 different countries. · OUR VISION IS UNLIMITED_ · We dare to believe that we do not need a clear visi ...