Commit 9b004c7
committed
[TRTLLM-11421][feat] Add per-iteration KV cache statistics and Prometheus metrics
Add real-time, per-iteration KV cache monitoring across 3 layers:
C++ layer (kvCacheManager):
- KvCacheIterationStats struct with 18 fields: pool gauges (primary/secondary
max/free/used), context-phase deltas (alloc total/new, reused full/partial,
missed, hit rate), generation-phase deltas (gen alloc), and transfer deltas
(onboard/offload blocks and bytes)
- getAndResetIterationStats() on BlockManager with atomic delta reset
- KvCacheTransferStats for onboard/offload byte counting
Python plumbing:
- Nanobind bindings for KvCacheIterationStats
- get_iteration_stats() on resource_manager, plumbed through PyExecutor
- Interval-gated collection via KvCacheConfig.iteration_stats_interval
- Stats serialization in base_worker as kvCacheIterationStats dict
Prometheus metrics (collector.py):
- 9 new metrics: pool gauges, reuse rate, missed/reused/gen-alloc counters,
onboard/offload byte counters, keyed by window_size label
Signed-off-by: Yueh-Ting Chen <yueh.ting.chen@gmail.com>1 parent 68001ce commit 9b004c7
File tree
10 files changed
+434
-9
lines changed- cpp
- include/tensorrt_llm/batch_manager
- tensorrt_llm
- batch_manager
- nanobind/batch_manager
- tensorrt_llm
- _torch/pyexecutor
- executor
- llmapi
- metrics
10 files changed
+434
-9
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
172 | 172 | | |
173 | 173 | | |
174 | 174 | | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
175 | 208 | | |
176 | 209 | | |
177 | 210 | | |
| |||
707 | 740 | | |
708 | 741 | | |
709 | 742 | | |
| 743 | + | |
| 744 | + | |
| 745 | + | |
| 746 | + | |
| 747 | + | |
| 748 | + | |
710 | 749 | | |
711 | 750 | | |
712 | 751 | | |
| |||
1006 | 1045 | | |
1007 | 1046 | | |
1008 | 1047 | | |
1009 | | - | |
| 1048 | + | |
1010 | 1049 | | |
1011 | | - | |
| 1050 | + | |
1012 | 1051 | | |
1013 | | - | |
| 1052 | + | |
| 1053 | + | |
| 1054 | + | |
| 1055 | + | |
| 1056 | + | |
1014 | 1057 | | |
1015 | 1058 | | |
1016 | 1059 | | |
1017 | | - | |
| 1060 | + | |
1018 | 1061 | | |
| 1062 | + | |
| 1063 | + | |
1019 | 1064 | | |
1020 | 1065 | | |
1021 | 1066 | | |
| |||
1032 | 1077 | | |
1033 | 1078 | | |
1034 | 1079 | | |
| 1080 | + | |
| 1081 | + | |
| 1082 | + | |
| 1083 | + | |
| 1084 | + | |
| 1085 | + | |
| 1086 | + | |
| 1087 | + | |
| 1088 | + | |
1035 | 1089 | | |
1036 | 1090 | | |
1037 | 1091 | | |
| |||
1230 | 1284 | | |
1231 | 1285 | | |
1232 | 1286 | | |
| 1287 | + | |
| 1288 | + | |
| 1289 | + | |
| 1290 | + | |
| 1291 | + | |
| 1292 | + | |
| 1293 | + | |
| 1294 | + | |
| 1295 | + | |
| 1296 | + | |
| 1297 | + | |
| 1298 | + | |
| 1299 | + | |
1233 | 1300 | | |
1234 | 1301 | | |
1235 | 1302 | | |
| |||
1536 | 1603 | | |
1537 | 1604 | | |
1538 | 1605 | | |
| 1606 | + | |
| 1607 | + | |
| 1608 | + | |
| 1609 | + | |
1539 | 1610 | | |
1540 | 1611 | | |
1541 | 1612 | | |
| |||
1878 | 1949 | | |
1879 | 1950 | | |
1880 | 1951 | | |
| 1952 | + | |
| 1953 | + | |
| 1954 | + | |
| 1955 | + | |
| 1956 | + | |
1881 | 1957 | | |
1882 | 1958 | | |
1883 | 1959 | | |
| |||
Lines changed: 25 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
30 | 40 | | |
31 | 41 | | |
32 | 42 | | |
| |||
57 | 67 | | |
58 | 68 | | |
59 | 69 | | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
60 | 73 | | |
61 | 74 | | |
62 | 75 | | |
| |||
79 | 92 | | |
80 | 93 | | |
81 | 94 | | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
82 | 101 | | |
83 | 102 | | |
84 | 103 | | |
| |||
90 | 109 | | |
91 | 110 | | |
92 | 111 | | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
93 | 118 | | |
94 | 119 | | |
95 | 120 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
666 | 666 | | |
667 | 667 | | |
668 | 668 | | |
| 669 | + | |
| 670 | + | |
669 | 671 | | |
670 | 672 | | |
671 | 673 | | |
| 674 | + | |
672 | 675 | | |
673 | 676 | | |
674 | 677 | | |
| |||
1324 | 1327 | | |
1325 | 1328 | | |
1326 | 1329 | | |
| 1330 | + | |
| 1331 | + | |
| 1332 | + | |
| 1333 | + | |
| 1334 | + | |
| 1335 | + | |
| 1336 | + | |
| 1337 | + | |
1327 | 1338 | | |
1328 | 1339 | | |
1329 | 1340 | | |
| |||
1510 | 1521 | | |
1511 | 1522 | | |
1512 | 1523 | | |
| 1524 | + | |
1513 | 1525 | | |
1514 | 1526 | | |
1515 | 1527 | | |
| |||
1784 | 1796 | | |
1785 | 1797 | | |
1786 | 1798 | | |
| 1799 | + | |
| 1800 | + | |
| 1801 | + | |
| 1802 | + | |
| 1803 | + | |
| 1804 | + | |
| 1805 | + | |
| 1806 | + | |
| 1807 | + | |
| 1808 | + | |
| 1809 | + | |
| 1810 | + | |
| 1811 | + | |
| 1812 | + | |
| 1813 | + | |
| 1814 | + | |
| 1815 | + | |
| 1816 | + | |
| 1817 | + | |
| 1818 | + | |
| 1819 | + | |
| 1820 | + | |
| 1821 | + | |
| 1822 | + | |
| 1823 | + | |
| 1824 | + | |
| 1825 | + | |
| 1826 | + | |
| 1827 | + | |
| 1828 | + | |
| 1829 | + | |
| 1830 | + | |
| 1831 | + | |
| 1832 | + | |
| 1833 | + | |
| 1834 | + | |
| 1835 | + | |
| 1836 | + | |
| 1837 | + | |
| 1838 | + | |
| 1839 | + | |
| 1840 | + | |
| 1841 | + | |
| 1842 | + | |
| 1843 | + | |
| 1844 | + | |
| 1845 | + | |
| 1846 | + | |
| 1847 | + | |
| 1848 | + | |
| 1849 | + | |
| 1850 | + | |
| 1851 | + | |
| 1852 | + | |
| 1853 | + | |
| 1854 | + | |
| 1855 | + | |
| 1856 | + | |
| 1857 | + | |
| 1858 | + | |
| 1859 | + | |
| 1860 | + | |
| 1861 | + | |
| 1862 | + | |
1787 | 1863 | | |
1788 | 1864 | | |
1789 | 1865 | | |
| |||
Lines changed: 59 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
273 | 273 | | |
274 | 274 | | |
275 | 275 | | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
276 | 280 | | |
277 | 281 | | |
278 | 282 | | |
| |||
309 | 313 | | |
310 | 314 | | |
311 | 315 | | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
312 | 320 | | |
313 | 321 | | |
314 | 322 | | |
| |||
347 | 355 | | |
348 | 356 | | |
349 | 357 | | |
| 358 | + | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
350 | 409 | | |
0 commit comments