DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models

Detecting anomalies in business processes is crucial for ensuring operational success. While many existing methods rely on statistical frequency to detect anomalies, it's important to note that infrequent behavior doesn't necessarily imply undesirability. To address this challenge, detecting anomalies from a semantic viewpoint proves to be a more effective approach. However, current semantic anomaly detection methods treat a trace (i.e., process instance) as multiple event pairs, disrupting long-distance dependencies. In this paper, we introduce DABL, a novel approach for detecting semantic anomalies in business processes using large language models (LLMs). We collect 143,137 real-world process models from various domains. By generating normal traces through the playout of these process models and simulating both ordering and exclusion anomalies, we fine-tune Llama 2 using the resulting log. Furthermore, DABL offers the capability to interpret the causes of anomalies in natural language, providing valuable insights into the detected anomalies. Through extensive experiments, we demonstrate that DABL surpasses existing state-of-the-art semantic anomaly detection methods in terms of both generalization ability and learning of given processes. Additionally, users can directly apply DABL to detect semantic anomalies in their own datasets without the need for additional training.

Datasets

To effectively fine-tune LLMs for developing a generic model capable of detecting semantic anomalies in business processes, a log meeting the following criteria is imperative: i) it must encompass both normal and anomalous traces, ii) it should contain rich semantic information (i.e., the activities should not be represented by meaningless characters), and iii) the traces within it should stem from diverse processes across various domains. Since such a log is not available in the real world, we generate normal traces by playout of the real-world process models from the BPM Academic Initiative (BPMAI) , fundamentals of business process management (FBPM), and SAP signavio academic models (SAP-SAM). These process models cover a broad range of domains, including common processes related to order and request handling, as well as specialized processes from fields such as software engineering and healthcare. We then generate synthetic anomalies from these normal traces.

Test Dataset

We allocate 1,000 process models for generating the test dataset $\mathcal{D}_1$. These models produce 14,387 normal traces, and we randomly simulate anomalies, resulting in 13,694 anomalous traces. In total, the test dataset comprises 28,081 traces.

From 143,137 process models used for generating the training dataset, we randomly select 1,000 process models to create the test dataset $\mathcal{D}_2$. These 1,000 process models produce 21,298 normal traces, and we randomly simulate anomalies, resulting in 19,627 anomalous traces. In total, the test dataset comprises 40,925 traces. Note that, although the normal traces within the test dataset $\mathcal{D}_2$ are identical to those in the training dataset, the simulated anomalies are not.

In summary, the test dataset $\mathcal{D}_1$ is used to evaluate the model's generalization ability, verifying if the model can detect anomalies of unseen processes. The test dataset $\mathcal{D}_2$ aims to validate the model's performance on seen processes but unseen anomalies (i.e., learning of given processes).

Experiment Results

Semantic anomaly detection results on dataset $\mathcal{D}_1$. The best results are indicated using bold typeface.

	Prec.(%)	Rec.(%)	F$_1$(%)	Acc.(%)
SEM	48.67	46.8	47.72	50.81
SENSE-SVM	87.95	1.12	2.20	52.50
SENSE-BERT	48.17	97.74	64.53	48.47
DBAL	94.06	89.79	91.88	92.39

Semantic anomaly detection results on dataset $\mathcal{D}_2$. The best results are indicated using bold typeface.

	Prec.(%)	Rec.(%)	F$_1$(%)	Acc.(%)
SEM	71.91±1.24	48.63±1.38	58.02±1.15	66.75±1.34
SENSE-SVM	90.28±2.18	28.64±1.23	43.49±1.56	64.82±1.82
SENSE-BERT	93.16±0.87	62.88±0.58	75.08±0.72	80.28±0.84
DBAL	98.12±0.00	95.64±0.00	96.87±0.00	97.03±0.00

Using Our Code to Reproduce the Results

Create conda environment.

 conda install --yes --file requirements.txt # You may need to downgrade the torch using pip to match the CUDA version

Download the process model datasets and save them to /dataset/process_model

   ├── process_model      
   │ ├── BPMAI
   │ │ ├── description.txt 
   │ │ └── models
   │ ├── FBPM2-ProcessModels
   │ │ ├── Chapter1 
   │ │ ├── ...
   │ │ ├── Chapter11
   │ │ └── info.txt
   │ ├── sap_sam_2022
   │ │ ├── models
   │ │ ├── LICENSE.txt
   │ │ └── README.txt

Download open-source LLM Llama-2-13b-chat-hf, and save it to the root directory

   ├── llama-2-13b-chat-hf
   │ ├── config.json
   │ ├── generation_config.json
   │ ├── LICENSE.txt
   │ ├── model-00001-of-00003.safetensors
   │ ├── model-00002-of-00003.safetensors
   │ ├── model-00003-of-00003.safetensors
   │ ├── model.safetensors.index.json
   │ ├── special_tokens_map.json
   │ ├── tokenizer.json
   │ ├── tokenizer.model
   │ └── tokenizer_config.json

Generate Datasets

Run python dataset_pre.py from the root directory.
We have provided the test datasets $\mathcal{D}_1$ and $\mathcal{D}_2$ in the 'dataset' folder. The file 'test_dataset_1.jsonl' is test dataset $\mathcal{D}_1$ for evaluating anomaly detection performance, while 'test_dataset_cause_1.jsonl', which contains only anomalies, is test dataset $\mathcal{D}_1$ for evaluating the interpretation of the cause of anomalies. Similarly, there are corresponding files for test dataset $\mathcal{D}_2$.

Fine-tune Llama2. This step can be skipped by directly using our fine-tuned model (llama-13b-int4-dolly)

Run python fine-tune-BPAD.py from the root directory to get fine-tuned model (llama-13b-int4-dolly)

Evaluate on test dataset.

Run python eval-BPAD.py from the root directory.

Running Tests on the User's Own Real-World Event Log in '.xes' Format

Perform semantic anomaly detection by executing the following command from the root directory:

    python test_realLog.py --data_path dataset/BPIC20_PermitLog.xes

This command will directly apply our fine-tuned model (llama-13b-int4-dolly) to the user's real-world event log in '.xes' format.

We provide the zip files of real-world logs: 'BPIC20_PermitLog.zip' and 'Road_Traffic_Fine_Management_Process.zip'. After unzipping these files, you will obtain 'BPIC20_PermitLog.xes' and 'Road_Traffic_Fine_Management_Process.xes'. You can use these '.xes' files to conduct your tests.

To Cite Our Paper

@article{guan2024dabl,
  title={DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models},
  author={Guan, Wei and Cao, Jian and Gao, Jianqi and Zhao, Haiyan and Qian, Shiyou},
  journal={arXiv preprint arXiv:2406.15781},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
conversion		conversion
dataset		dataset
generation		generation
labelparser		labelparser
llama-13b-int4-dolly		llama-13b-int4-dolly
processmining		processmining
utils		utils
README.md		README.md
dataset_pre.py		dataset_pre.py
eval-BPAD.py		eval-BPAD.py
fine-tune-BPAD.py		fine-tune-BPAD.py
overall.png		overall.png
prompt.py		prompt.py
requirements.txt		requirements.txt
test_realLog.py		test_realLog.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models

Datasets

Test Dataset

Experiment Results

Using Our Code to Reproduce the Results

Running Tests on the User's Own Real-World Event Log in '.xes' Format

To Cite Our Paper

About

Releases

Packages

Languages

guanwei49/DABL

Folders and files

Latest commit

History

Repository files navigation

DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models

Datasets

Test Dataset

Experiment Results

Using Our Code to Reproduce the Results

Running Tests on the User's Own Real-World Event Log in '.xes' Format

To Cite Our Paper

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages