The One Token Model: A Multi-Layer Framework for the Granular Estimation of AI Inference Energy
APA
(2026). The One Token Model: A Multi-Layer Framework for the Granular Estimation of AI Inference Energy. SciVideos. https://videos.cern.ch/record/3025615
MLA
The One Token Model: A Multi-Layer Framework for the Granular Estimation of AI Inference Energy. SciVideos, May. 07, 2026, https://videos.cern.ch/record/3025615
BibTex
@misc{ scivideos_oai:cds.cern.ch:3025615,
doi = {},
url = {https://videos.cern.ch/record/3025615},
author = {},
keywords = {},
language = {en},
title = {The One Token Model: A Multi-Layer Framework for the Granular Estimation of AI Inference Energy},
publisher = {},
year = {2026},
month = {may},
note = {oai:cds.cern.ch:3025615 see, \url{https://scivideos.org/cern-cds/3025615}}
}
Francois, Mathieu
Talk numberoai:cds.cern.ch:3025615
Source RepositoryCERN-CDS
Collection
Subject
Abstract
The integration of Large Language Models (LLMs) into research workflows introduces a largely opaque layer of carbon intensity. Existing approaches to estimating AI energy consumption rely on time-based heuristics or static hardware profiling, which fail to capture the non-deterministic nature of generative inference. Variations in prompt design, quantization, and decoding strategies can lead to significant fluctuations in energy use, limiting the effectiveness of current sustainability assessments. This paper introduces the One Token Model (OTM), a unified framework that redefines energy measurement through output-normalized attribution, expressed as Joules per token. OTM integrates telemetry across three layers: infrastructure dynamics, model architecture, and inference behavior. We validate OTM through a real-time monitoring system that quantifies the marginal energy cost of individual inference requests. By enabling fine-grained, comparable measurements across systems, OTM supports energy-aware optimization and promotes more sustainable, transparent research computing practices.00:00:00 Slide 1
00:00:45 Slide 2
00:03:12 Slide 3
00:04:01 Slide 4
00:05:21 Slide 5
00:07:47 Slide 6
00:08:16 Slide 7
00:08:50 Slide 8
00:09:57 Slide 9
00:10:37 Slide 10
00:11:28 Slide 11
00:14:09 Slide 12
00:17:05 Slide 13
00:18:12 Slide 14
00:19:09 Slide 15
00:26:58 Slide 16
00:28:50 Slide 17