This work investigates the transition from traditional Time Series Forecasting (TSF) to Large Time Series Models (LTSMs), leveraging universal transformer-based models. Training LTSMs on diverse time series data introduces challenges due to varying frequencies, dimensions, and patterns. We explore various design choices for LTSMs, including pre-processing, model configurations, and dataset setups. We introduce Time Series Prompt, a statistical prompting strategy, and
🌟 Please star our repo to follow the latest updates on LTSM-bundle!
📣 We have released our paper and source code of LTSM-bundle-v1.0!
📚 Follow our latest English Tutorial or 中文教程 to costomize your LTSM!
🌎 For more information, please visit:
- Paper: https://arxiv.org/abs/2406.14045
- Blog: Time Series Are Not That Different for LLMs
- Tutorial: Build your own LTSM-bundle
- Chinese Tutorial: https://zhuanlan.zhihu.com/p/708804309
- Do you want to learn more about data pipeline search? Please check out our data-centric AI survey and data-centric AI resources !
The LTSM-bundle package leverages the HuggingFace transformers toolkit, offering flexibility to switch between different advanced language models as the backbone. It is easy to tailor the general LTSMs to their specific time series forecasting needs by selecting the most suitable language model from a wide array of options. The flexibility enhances the adaptability of the package across different industries and data types, ensuring optimal performance in diverse scenarios.
conda create -n ltsm python=3.8.0
conda activate ltsm
git clone git@github.com:daochenzha/ltsm.git
cd ltsm
pip3 install -e .
pip3 install -r requirements.txt
Training on [Time Series Prompt] and [Linear Tokenization]
bash scripts/train_ltsm_csv.sh
Training on [Text Prompt] and [Linear Tokenization]
bash scripts/train_ltsm_textprompt_csv.sh
Training on [Time Series Prompt] and [Time Series Tokenization]
bash scripts/train_ltsm_tokenizer_csv.sh
Download the datasets
cd datasets
download: https://drive.google.com/drive/folders/1hLFbz0FRxdiDCzgFYtKCOPJYSBVvwW9P
Download the time series prompts
cd prompt_bank/propmt_data_csv
download: https://drive.google.com/drive/folders/1hLFbz0FRxdiDCzgFYtKCOPJYSBVvwW9P
If you find this work useful, you may cite this work:
@article{ltsm-bundle,
title={Understanding Different Design Choices in Training Large Time Series Models},
author={Chuang*, Yu-Neng and Li*, Songchen and Yuan*, Jiayi and Wang*, Guanchu and Lai*, Kwei-Herng and Yu, Leisheng and Ding, Sirui and Chang, Chia-Yuan and Tan, Qiaoyu and Zha, Daochen and Hu, Xia},
journal={arXiv preprint arXiv:2406.14045},
year={2024}
}