首页 /研究 /Energy and Cost Considerations for GPU Accelerated AI Inference Workloads

LEARNING

Energy and Cost Considerations for GPU Accelerated AI Inference Workloads

Tergel Molom-Ochir, Rohan Shenoy

发表年份: 2021
引用次数: 3

摘要

Recent advances in AI have motivated hardware manufacturers to design deep learning friendly accelerators to keep with the ever-growing increases in model sizes and compu-tational requirements. While early accelerators were utilized for model training, newer accelerators are capable of running deep neural network (DNN) model inferences and are increasingly used in robotics, vision, and edge applications. In this paper, we compare several popular embedded and desktop GPUs with respect to their performance and energy efficiency. Our results show that although larger devices always provide higher throughput, they are not always the most energy-efficient. GPUs vary in terms of their energy efficiency. To aid the process of hardware selection for a system designer, we use our experimental results to design a recommendation algorithm that chooses the ideal hardware accelerator under cost, power, and performance constraints.

关键词

Computer scienceHardware accelerationEfficient energy useInferenceDeep learningArtificial intelligenceProcess (computing)ThroughputEdge deviceArtificial neural network

Energy and Cost Considerations for GPU Accelerated AI Inference Workloads

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory