SlideShare a Scribd company logo
Three Developments in AI Infra
Robert Nishihara
Co-creator of Ray, Co-founder of Anyscale
My Background
2013: Began PhD in AI
2016: Started Ray along with Philipp Moritz and Ion Stoica
2019: Started Anyscale to commercialize Ray
AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment with Ray
(π)
Physical Intelligence
01.AI
AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment with Ray
Three Industry Trends
How the world has changed since we started Ray
Three Industry Trends
Three Industry Trends
Scaling Laws
● Larger models, more compute, more data → Better results
● Top priority for many AI infra teams: “enable training on 100x more data”
● ML platform / infra teams are on the critical path
● Scaling data -> scaling compute
Three Industry Trends
Scaling Laws
● Larger models, more compute, more data → Better results
● Top priority for many AI infra teams: “enable training on 100x more data”
● ML platform / infra teams are on the critical path
● Scaling data -> scaling compute
Question: How can I use more compute to get
higher quality data?
Three Industry Trends
Scaling Laws
● Larger models, more compute, more data → Better results
● Top priority for many AI infra teams: “enable training on 100x more data”
● ML platform / infra teams are on the critical path
● Scaling data -> scaling compute
AI Data Processing
● Data processing: SQL centric → AI centric
● Multimodal data (text, images, video, audio, …)
● New requirements: data intensive and GPU intensive
Three Industry Trends
Scaling Laws
● Larger models, more compute, more data → Better results
● Top priority for many AI infra teams: “enable training on 100x more data”
● ML platform / infra teams are on the critical path
● Scaling data -> scaling compute
AI Data Processing
● Data processing: SQL centric → AI centric
● Multimodal data (text, images, video, audio, …)
● New requirements: data intensive and GPU intensive
Question: How can I use AI to get more insights from my data?
Three Industry Trends
Scaling Laws
● Larger models, more compute, more data → Better results
● Top priority for many AI infra teams: “enable training on 100x more data”
● ML platform / infra teams are on the critical path
● Scaling data -> scaling compute
AI Data Processing
● Data processing: SQL centric → AI centric
● Multimodal data (text, images, video, audio, …)
● New requirements: data intensive and GPU intensive
Heterogeneity
● Multimodality, hardware accelerators, frameworks, clouds, models
Three Industry Trends
Scaling Laws
● Larger models, more compute, more data → Better results
● Top priority for many AI infra teams: “enable training on 100x more data”
● ML platform / infra teams are on the critical path
● Scaling data -> scaling compute
AI Data Processing
● Data processing: SQL centric → AI centric
● Multimodal data (text, images, video, audio, …)
● New requirements: data intensive and GPU intensive
Heterogeneity
● Multimodality, hardware accelerators, frameworks, clouds, models
AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment with Ray

More Related Content

PPTX
First Steps on Big Data
PDF
Hadoop and SAP BI
PPTX
data analytics lecture2.pptx
PPT
Information Security Analytics
PDF
Big data Introduction by Mohan
PDF
Problem Definition muAoPS | Analytics Problem Solving | Mu Sigma
PDF
Driving Customer Loyalty with Azure Machine Learning
 
PPT
SurveyAnalytics MaxDiff Webinar
First Steps on Big Data
Hadoop and SAP BI
data analytics lecture2.pptx
Information Security Analytics
Big data Introduction by Mohan
Problem Definition muAoPS | Analytics Problem Solving | Mu Sigma
Driving Customer Loyalty with Azure Machine Learning
 
SurveyAnalytics MaxDiff Webinar

Similar to AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment with Ray (20)

PPT
SurveyAnalytics MaxDiff Webinar Slides
PPTX
Big Data Developer Career Path: Job & Interview Preparation
PDF
Big Data - A Real Life Revolution
PDF
Cloud as a Data Platform
PDF
Big Data LDN 2018: THE PATH TO ENTERPRISE AI: TALES FROM THE FIELD
PDF
R for SAS Users Complement or Replace Two Strategies
PDF
Big Data Challenges
PDF
Advanced Analytics: Going From Big Data to Big Answers
PPTX
Why Everything You Know About bigdata Is A Lie
PDF
Gse uk-cedrinemadera-2018-shared
PPTX
Big data analytics presented at meetup big data for decision makers
PDF
What Managers Need to Know about Data Science
PDF
A6 big data_in_the_cloud
PDF
Agile data science
PDF
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
PPTX
Unit-I_Big data life cycle.pptx, sources of Big Data
PPT
Maxdiff webinar_10_19_10
PDF
02 a holistic approach to big data
PDF
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
PPTX
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
SurveyAnalytics MaxDiff Webinar Slides
Big Data Developer Career Path: Job & Interview Preparation
Big Data - A Real Life Revolution
Cloud as a Data Platform
Big Data LDN 2018: THE PATH TO ENTERPRISE AI: TALES FROM THE FIELD
R for SAS Users Complement or Replace Two Strategies
Big Data Challenges
Advanced Analytics: Going From Big Data to Big Answers
Why Everything You Know About bigdata Is A Lie
Gse uk-cedrinemadera-2018-shared
Big data analytics presented at meetup big data for decision makers
What Managers Need to Know about Data Science
A6 big data_in_the_cloud
Agile data science
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Unit-I_Big data life cycle.pptx, sources of Big Data
Maxdiff webinar_10_19_10
02 a holistic approach to big data
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Big Data Matching - How to Find Two Similar Needles in a Really Big Haystack
Ad

More from Alluxio, Inc. (20)

PDF
Introduction to Apache Iceberg™ & Tableflow
PDF
Optimizing Tiered Storage for Low-Latency Real-Time Analytics at AI Scale
PDF
Meet in the Middle: Solving the Low-Latency Challenge for Agentic AI
PDF
From Data Preparation to Inference: How Alluxio Speeds Up AI
PDF
Best Practice for LLM Serving in the Cloud
PDF
Meet You in the Middle: 1000x Performance for Parquet Queries on PB-Scale Dat...
PDF
How Coupang Leverages Distributed Cache to Accelerate ML Model Training
PDF
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
PDF
AI/ML Infra Meetup | Building Production Platform for Large-Scale Recommendat...
PDF
AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune
PDF
AI/ML Infra Meetup | Optimizing ML Data Access with Alluxio: Preprocessing, ...
PDF
AI/ML Infra Meetup | Deployment, Discovery and Serving of LLMs at Uber Scale
PDF
Alluxio Webinar | What’s New in Alluxio AI: 3X Faster Checkpoint File Creatio...
PDF
AI/ML Infra Meetup | A Faster and More Cost Efficient LLM Inference Stack
PDF
AI/ML Infra Meetup | Balancing Cost, Performance, and Scale - Running GPU/CPU...
PDF
Alluxio Webinar | Accelerate AI: Alluxio 101
PDF
AI/ML Infra Meetup | The power of Ray in the era of LLM and multi-modality AI
PDF
AI/ML Infra Meetup | Exploring Distributed Caching for Faster GPU Training wi...
PDF
AI/ML Infra Meetup | Big Data and AI, Zoom Developers
PDF
AI/ML Infra Meetup | TorchTitan, One-stop PyTorch native solution for product...
Introduction to Apache Iceberg™ & Tableflow
Optimizing Tiered Storage for Low-Latency Real-Time Analytics at AI Scale
Meet in the Middle: Solving the Low-Latency Challenge for Agentic AI
From Data Preparation to Inference: How Alluxio Speeds Up AI
Best Practice for LLM Serving in the Cloud
Meet You in the Middle: 1000x Performance for Parquet Queries on PB-Scale Dat...
How Coupang Leverages Distributed Cache to Accelerate ML Model Training
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
AI/ML Infra Meetup | Building Production Platform for Large-Scale Recommendat...
AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune
AI/ML Infra Meetup | Optimizing ML Data Access with Alluxio: Preprocessing, ...
AI/ML Infra Meetup | Deployment, Discovery and Serving of LLMs at Uber Scale
Alluxio Webinar | What’s New in Alluxio AI: 3X Faster Checkpoint File Creatio...
AI/ML Infra Meetup | A Faster and More Cost Efficient LLM Inference Stack
AI/ML Infra Meetup | Balancing Cost, Performance, and Scale - Running GPU/CPU...
Alluxio Webinar | Accelerate AI: Alluxio 101
AI/ML Infra Meetup | The power of Ray in the era of LLM and multi-modality AI
AI/ML Infra Meetup | Exploring Distributed Caching for Faster GPU Training wi...
AI/ML Infra Meetup | Big Data and AI, Zoom Developers
AI/ML Infra Meetup | TorchTitan, One-stop PyTorch native solution for product...
Ad

Recently uploaded (20)

PPT
JAVA ppt tutorial basics to learn java programming
PPTX
Introduction to Artificial Intelligence
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
System and Network Administraation Chapter 3
PDF
AI in Product Development-omnex systems
PPTX
L1 - Introduction to python Backend.pptx
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PPTX
Essential Infomation Tech presentation.pptx
PDF
PTS Company Brochure 2025 (1).pdf.......
JAVA ppt tutorial basics to learn java programming
Introduction to Artificial Intelligence
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Operating system designcfffgfgggggggvggggggggg
How to Choose the Right IT Partner for Your Business in Malaysia
Odoo POS Development Services by CandidRoot Solutions
Design an Analysis of Algorithms I-SECS-1021-03
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
VVF-Customer-Presentation2025-Ver1.9.pptx
System and Network Administraation Chapter 3
AI in Product Development-omnex systems
L1 - Introduction to python Backend.pptx
Understanding Forklifts - TECH EHS Solution
Upgrade and Innovation Strategies for SAP ERP Customers
ManageIQ - Sprint 268 Review - Slide Deck
How to Migrate SBCGlobal Email to Yahoo Easily
Which alternative to Crystal Reports is best for small or large businesses.pdf
Essential Infomation Tech presentation.pptx
PTS Company Brochure 2025 (1).pdf.......

AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment with Ray

  • 1. Three Developments in AI Infra Robert Nishihara Co-creator of Ray, Co-founder of Anyscale
  • 2. My Background 2013: Began PhD in AI 2016: Started Ray along with Philipp Moritz and Ion Stoica 2019: Started Anyscale to commercialize Ray
  • 6. Three Industry Trends How the world has changed since we started Ray
  • 8. Three Industry Trends Scaling Laws ● Larger models, more compute, more data → Better results ● Top priority for many AI infra teams: “enable training on 100x more data” ● ML platform / infra teams are on the critical path ● Scaling data -> scaling compute
  • 9. Three Industry Trends Scaling Laws ● Larger models, more compute, more data → Better results ● Top priority for many AI infra teams: “enable training on 100x more data” ● ML platform / infra teams are on the critical path ● Scaling data -> scaling compute Question: How can I use more compute to get higher quality data?
  • 10. Three Industry Trends Scaling Laws ● Larger models, more compute, more data → Better results ● Top priority for many AI infra teams: “enable training on 100x more data” ● ML platform / infra teams are on the critical path ● Scaling data -> scaling compute AI Data Processing ● Data processing: SQL centric → AI centric ● Multimodal data (text, images, video, audio, …) ● New requirements: data intensive and GPU intensive
  • 11. Three Industry Trends Scaling Laws ● Larger models, more compute, more data → Better results ● Top priority for many AI infra teams: “enable training on 100x more data” ● ML platform / infra teams are on the critical path ● Scaling data -> scaling compute AI Data Processing ● Data processing: SQL centric → AI centric ● Multimodal data (text, images, video, audio, …) ● New requirements: data intensive and GPU intensive Question: How can I use AI to get more insights from my data?
  • 12. Three Industry Trends Scaling Laws ● Larger models, more compute, more data → Better results ● Top priority for many AI infra teams: “enable training on 100x more data” ● ML platform / infra teams are on the critical path ● Scaling data -> scaling compute AI Data Processing ● Data processing: SQL centric → AI centric ● Multimodal data (text, images, video, audio, …) ● New requirements: data intensive and GPU intensive Heterogeneity ● Multimodality, hardware accelerators, frameworks, clouds, models
  • 13. Three Industry Trends Scaling Laws ● Larger models, more compute, more data → Better results ● Top priority for many AI infra teams: “enable training on 100x more data” ● ML platform / infra teams are on the critical path ● Scaling data -> scaling compute AI Data Processing ● Data processing: SQL centric → AI centric ● Multimodal data (text, images, video, audio, …) ● New requirements: data intensive and GPU intensive Heterogeneity ● Multimodality, hardware accelerators, frameworks, clouds, models