This is the official repo for our FSE 2025 accepted paper: No More Labelled Examples? An Unsupervised Log Parser with LLMs
git clone [email protected]:Jun-jie-Huang/LUNAR.gitRun the following command to install the dependencies
conda create -n lunar python=3.8.19
conda activate lunar
cd LUNAR
pip install -r requirements.txtAdd your openai API key and base url by updating line 195 in ./LUNAR/config.py file
LLM_BASE_MAPPING={
"gpt35": ["gpt-3.5-turbo-0125", "API_BASE_URL", "API_KEY"],
}bash run_lunar.sh Apache
In this repo, we provide the Apache dataset as an example due to the github file size limitation. To experiment with more datasets, please follow the below steps.
You can download the other datasets in LogHub2.0 from this link and move them to ./dataset just like the Apache dataset.
The industrial parsing datasets, including, CTS, HiBench, and LoFI, can be found in ./dataset.
You can run the following command to evaluate on all datasets
bash run_lunar.sh all
You can also run an efficient version LUNAR-Parallel with the following steps:
Add your openai API key and base url by updating line 18-19 in main_parallel.py file
API_KEY="sk-API_KEY"
BASE_URL="API_BASE_URL"bash run_lunar_parallel.sh Apache
After you download the Loghub-2.0 datasets, you can use the following scripts to evaluate on all datasets
bash run_lunar_parallel.sh all
In our paper, we annotate a new log parsing dataset with industrial logs LoFI. The original logs are collected from our ISSRE-2024 paper Demystifying and Extracting Fault-indicating Information from Logs for Failure Diagnosis.
You can find our dataset at ./dataset/LoFI.
If you find our work useful, please consider citing our paper:
@article{huang2024lunar,
author={Huang, Junjie and Jiang, Zhihan and Chen, Zhuangbin and Lyu, Michael},
title={No More Labelled Examples? An Unsupervised Log Parser with LLMs},
year={2025},
issue_date={July 2025},
publisher={Association for Computing Machinery},
address={New York, NY, USA},
volume={2},
number={FSE},
url={https://doi-org.easyaccess1.lib.cuhk.edu.hk/10.1145/3729377},
doi={10.1145/3729377},
journal={Proc. ACM Softw. Eng.},
month=jun,
articleno={FSE107},
numpages={24}
}
@inproceedings{huang2024demystifying,
title={Demystifying and Extracting Fault-indicating Information from Logs for Failure Diagnosis},
author={Huang, Junjie and Jiang, Zhihan and Liu, Jinyang and Huo, Yintong and Gu, Jiazhen and Chen, Zhuangbin and Feng, Cong and Dong, Hui and Yang, Zengyin and Lyu, Michael R},
booktitle={2024 IEEE 35th International Symposium on Software Reliability Engineering (ISSRE)},
pages={511--522},
year={2024},
organization={IEEE}
}
