Investigation of Performance and Energy Consumption of Tokenization Algorithms on Multi-core CPUs Under Power Capping

Oksana Diakun; Jan Dobrosolski; Paweł Czarnul

doi:10.1007/978-3-031-71115-2_23

In this paper we investigate performance-energy optimization of tokenizer algorithm training using power capping. We focus on parallel, multi-threaded implementations of Byte Pair Encoding (BPE), Unigram, WordPiece, and WordLevel run on two systems with different multi-core CPUs: Intel Xeon 6130 and desktop Intel i7-13700K. We analyze execution times and energy consumption for various numbers of threads and various power caps and demonstrate that energy consumption can be minimized for both CPUs, while metrics such as EDP and EDS could be optimized for the i7-13700K CPU. We further show that percentage energy gain versus execution time loss could be optimized by 3–6% and 7–13%, depending on the algorithm, for the two CPUs respectively, by applying proper non-default power caps.

Autorzy

Pobierz publikację

Informacje dodatkowe

DOI: Cyfrowy identyfikator dokumentu elektronicznego link otwiera się w nowej karcie 10.1007/978-3-031-71115-2_23
Kategoria: Aktywność konferencyjna
Typ: publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
Język: angielski
Rok wydania: 2024

Źródło danych: MOSTWiedzy.pl - publikacja "Investigation of Performance and Energy Consumption of Tokenization Algorithms on Multi-core CPUs Under Power Capping" link otwiera się w nowej karcie

link otwiera się w nowej karcie

Repozytorium publikacji - Politechnika Gdańska

Treść strony