Unified Flash Memory and Reduced HBM are Reshaping AI Training and Inference with Phison



AI will need less HBM (high bandwidth memory) because flash memory unification is changing training and inference. This episode of the Tech Field Day podcast features Sebastien Jean from Phison, Max Mortillaro, Brian Martin, and Alastair Cooke. Training, fine-tuning, and inference with Large Language Models traditionally use GPUs with high bandwidth memory to hold entire data models and data sets. Phison’s aiDaptiv+ framework offers the ability to trade lower cost of infrastructure against training speed or allow larger data sets (context) for inference. This approach enables users to balance cost, compute, and memory needs, making larger models accessible without requiring top-of-the-line GPUs, and giving smaller companies more access to generative AI.

Panelists

Alastair Cooke

@DemitasseNZ

Alastair is a Tech Field Day event lead at the Futurum group, specializing in Cloud, DevOps, and Edge.

Brian Martin

Vice President of AI and Datacenter Performance at Signal65

Max Mortillaro

Max Mortillaro is an independent data center consultant specializing in virtualization and storage technologies.