Computer Science > Computation and Language

arXiv:2308.08241 (cs)

[Submitted on 16 Aug 2023 (v1), last revised 22 Feb 2024 (this version, v2)]

Title:TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series

Authors:Chenxi Sun, Hongyan Li, Yaliang Li, Shenda Hong

Abstract:This work summarizes two ways to accomplish Time-Series (TS) tasks in today's Large Language Model (LLM) context: LLM-for-TS (model-centric) designs and trains a fundamental large model, or fine-tunes a pre-trained LLM for TS data; TS-for-LLM (data-centric) converts TS into a model-friendly representation to enable the pre-trained LLM to handle TS data. Given the lack of data, limited resources, semantic context requirements, and so on, this work focuses on TS-for-LLM, where we aim to activate LLM's ability for TS data by designing a TS embedding method suitable for LLM. The proposed method is named TEST. It first tokenizes TS, builds an encoder to embed TS via instance-wise, feature-wise, and text-prototype-aligned contrast, where the TS embedding space is aligned to LLM embedding layer space, then creates soft prompts to make LLM more open to that embeddings, and finally implements TS tasks using the frozen LLM. We also demonstrate the feasibility of TS-for-LLM through theory and experiments. Experiments are carried out on TS classification, forecasting, and representation tasks using eight frozen LLMs with various structures and sizes. The results show that the pre-trained LLM with TEST strategy can achieve better or comparable performance than today's SOTA TS models and offer benefits for few-shot and generalization. By treating LLM as the pattern machine, TEST can endow LLM's ability to process TS data without compromising language ability. We hope that this study will serve as a foundation for future work to support TS+LLM progress.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.08241 [cs.CL]
	(or arXiv:2308.08241v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2308.08241

Submission history

From: Chenxi Sun [view email]
[v1] Wed, 16 Aug 2023 09:16:02 UTC (5,525 KB)
[v2] Thu, 22 Feb 2024 02:03:42 UTC (5,607 KB)

Computer Science > Computation and Language

Title:TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators