About The Workshop

Training and deploying huge machine learning models, such as GPT, Llama, or large GNNs, require a vast amount of compute resources, power, storage, memory. The size of such models is growing exponentially, as is the training time and the resources required. The cost to train large foundation models has become prohibitive for everyone but very few large players. While the challenges are most visible in training, similar considerations apply to deploying and serving large foundation models for a large user base.

The proposed workshop aims to bring together AI/ML researchers, computer architects, and engineers working on a range of topics focused on training and serving large ML models. The workshop will provide a forum for presenting and exchanging new ideas and experiences in this area and to discuss and explore hardware/software techniques and tools to lower the significant barrier of entry in the computation requirements of AI foundation models.

Motivation

We are seeking innovative, evolutionary and revolutionary ideas around software and hardware architectures for training such challenging models and strive to present and discuss new approaches that may lead to alternative solutions.

Location

The workshop will be held in Tokyo, Japan.

The workshop will be co-located with ISCA 2025.

Date: 22 June 2025





Call for papers

The workshop will present original works in areas such as (but not limited to):

  • Workload Characterization
  • Inference Serving at Scale
  • Distributed Training
  • Novel Networking and Interconnect Approaches for Large AI/ML Workloads
  • Addressing Resilience of Large Training Runs
  • Data Reduction Techniques
  • Better Model Partitioning
  • Data Formats and Precision
  • Efficient Hardware and Competitive Accelerators

Scope of Papers

Authors can submit either 8-page full papers or up to 4-page short papers.

For the short paper, out-of-the box ideas and position papers are especially encouraged.

Important Deadlines

All deadlines are at 11:59 PM AoE (Anywhere on Earth).

Paper Submission:15 April 2025
Accept Notification: 10 May 2025
Workshop Date: 22 June 2025