How DPUs accelerate AI/ML

News Date

Data processing units (DPUs) are the advanced forms of programmable network adapters that act as an interface between the server and the rest of the network in datacenters to help improve server utilization and power consumption. Hyperscale cloud providers have been using them for years to offload various functions. But these benefits are not limited to hyperscalers. This is particularly relevant when it comes to hosting and processing the large language models (LLMs) associated with generative artificial intelligence (AI) and machine learning (ML) enabled applications that put considerable strain on system CPUs and GPUs.