Computer Science > Cryptography and Security

arXiv:2312.09494 (cs)

[Submitted on 15 Dec 2023 (v1), last revised 18 Dec 2023 (this version, v2)]

Title:No-Skim: Towards Efficiency Robustness Evaluation on Skimming-based Language Models

Authors:Shengyao Zhang, Mi Zhang, Xudong Pan, Min Yang

Abstract:To reduce the computation cost and the energy consumption in large language models (LLM), skimming-based acceleration dynamically drops unimportant tokens of the input sequence progressively along layers of the LLM while preserving the tokens of semantic importance. However, our work for the first time reveals the acceleration may be vulnerable to Denial-of-Service (DoS) attacks. In this paper, we propose No-Skim, a general framework to help the owners of skimming-based LLM to understand and measure the robustness of their acceleration scheme. Specifically, our framework searches minimal and unnoticeable perturbations at character-level and token-level to generate adversarial inputs that sufficiently increase the remaining token ratio, thus increasing the computation cost and energy consumption. We systematically evaluate the vulnerability of the skimming acceleration in various LLM architectures including BERT and RoBERTa on the GLUE benchmark. In the worst case, the perturbation found by No-Skim substantially increases the running cost of LLM by over 145% on average. Moreover, No-Skim extends the evaluation framework to various scenarios, making the evaluation conductible with different level of knowledge.

Subjects:	Cryptography and Security (cs.CR); Computation and Language (cs.CL)
Cite as:	arXiv:2312.09494 [cs.CR]
	(or arXiv:2312.09494v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2312.09494

Submission history

From: Shengyao Zhang [view email]
[v1] Fri, 15 Dec 2023 02:42:05 UTC (1,514 KB)
[v2] Mon, 18 Dec 2023 02:50:02 UTC (1,514 KB)

Computer Science > Cryptography and Security

Title:No-Skim: Towards Efficiency Robustness Evaluation on Skimming-based Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:No-Skim: Towards Efficiency Robustness Evaluation on Skimming-based Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators