Computer Science > Machine Learning

arXiv:2408.08684 (cs)

[Submitted on 16 Aug 2024]

Title:Research on Personalized Compression Algorithm for Pre-trained Models Based on Homomorphic Entropy Increase

Abstract:In this article, we explore the challenges and evolution of two key technologies in the current field of AI: Vision Transformer model and Large Language Model (LLM). Vision Transformer captures global information by splitting images into small pieces and leveraging Transformer's multi-head attention mechanism, but its high reference count and compute overhead limit deployment on mobile devices. At the same time, the rapid development of LLM has revolutionized natural language processing, but it also faces huge deployment challenges. To address these issues, we investigate model pruning techniques, with a particular focus on how to reduce redundant parameters without losing accuracy to accommodate personalized data and resource-constrained environments. In this paper, a new layered pruning strategy is proposed to distinguish the personalized layer from the common layer by compressed sensing and random sampling, thus significantly reducing the model parameters. Our experimental results show that the introduced step buffering mechanism further improves the accuracy of the model after pruning, providing new directions and possibilities for the deployment of efficient and personalized AI models on mobile devices in the future.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2408.08684 [cs.LG]
	(or arXiv:2408.08684v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.08684

Submission history

From: Yicong Li [view email]
[v1] Fri, 16 Aug 2024 11:56:49 UTC (1,084 KB)

Computer Science > Machine Learning

Title:Research on Personalized Compression Algorithm for Pre-trained Models Based on Homomorphic Entropy Increase

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Research on Personalized Compression Algorithm for Pre-trained Models Based on Homomorphic Entropy Increase

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators