Nezami, Z. orcid.org/0000-0002-5962-5908, Hafeez, M., Djemame, K. et al. (1 more author) (2025) Generative AI on the Edge: Architecture and Performance Evaluation. ICC 2025 - IEEE International Conference on Communications. pp. 4595-4602. ISSN: 1550-3607
Abstract
6G's AI native vision of embedding advance intelligence in the network while bringing it closer to the user requires a systematic evaluation of Generative AI (GenAI) models on edge devices. Rapidly emerging solutions based on Open RAN (ORAN) and Network-in-aBox strongly advocate the use of low-cost, off-the-shelf components for simpler and efficient deployment, for example, in provisioning rural connectivity. In this context, conceptual architecture, hardware testbeds, and precise performance quantification of Large Language Models (LLMs) on off-the shelf edge devices remain largely unexplored. This research investigates computationally demanding LLM inference on a single commodity Raspberry Pi serving as an edge testbed for ORAN. We investigate various LLMs, including small, medium, and large models, on a Raspberry Pi 5 Cluster using a lightweight Kubernetes distribution (K3s) with modular prompting implementation. We study its feasibility and limitations by analyzing throughput, latency, accuracy, and efficiency. Our findings indicate that CPU-only deployment of lightweight models, such as Yi, Phi, and Llama3, can effectively support edge applications, achieving a generation throughput of 5 to 12 tokens per second with less than 50% CPU and RAM usage. We conclude that GenAI on the edge offers localized inference in remote or bandwidthconstrained environments in 6 G networks without reliance on cloud infrastructure.
Metadata
| Item Type: | Article |
|---|---|
| Authors/Creators: |
|
| Keywords: | 6G, GenAI, Large Language Model, Kubernetes, Edge AI |
| Dates: |
|
| Institution: | The University of Leeds |
| Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Electronic & Electrical Engineering (Leeds) |
| Date Deposited: | 17 Oct 2025 07:46 |
| Last Modified: | 17 Oct 2025 14:32 |
| Published Version: | https://ieeexplore.ieee.org/document/11161569 |
| Status: | Published |
| Publisher: | IEEE |
| Identification Number: | 10.1109/icc52391.2025.11161569 |
| Sustainable Development Goals: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:232968 |


CORE (COnnecting REpositories)
CORE (COnnecting REpositories)