CascadeRCG

CascadeRCG: Retrieval-Augmented Generation for Enhancing Professionalism and Knowledgeability in Online Mental Health Support

Paper Link

🌸 Vector Database

Features

Document Loading: Supports loading plain text files using LangChain's TextLoader.
Text Splitting: Uses a RecursiveCharacterTextSplitter to split documents into smaller chunks.
Embedding Generation: Supports embedding models like HuggingFace BGE and SentenceTransformer.
Storage: Stores embeddings in a Chroma vector database.

Example

python construct_vector_db.py -b /path/to/texts -m /path/to/model -c /path/to/chroma_db

Notice: Please be aware that due to copyright restrictions, the actual content of the database is not publicly available. However, our list of book names can refer to the books_list/list.json file.

☺️ Generation

pip install -r requirements.txt
python main.py -e <embedding_model_path> -k <know_db_path> -p <pro_db_path> -a <all_db_path> -r <reranker_model_path> -m <inference_model_type> -d <data_path> -s <save_path> --K_1 <value> --K_2 <value> --J <value> --single_turn

🔍 Evaluation

Criteria:

Evaluation Steps:

This tool evaluates data using the GPT-4 model. It supports two types of evaluations: "ethics" and "rag".

Setup

Set Up Environment Variables

Before running the tool, you need to set the following environment variables:
- OPENAI_API_KEY: Your OpenAI API key.
- OPENAI_API_BASE: The base URL for the OpenAI API.
You can set these variables in your terminal or command prompt:
```
export OPENAI_API_KEY='your-api-key'
export OPENAI_API_BASE='https://api.openai.com'
```

Usage

Run the script with the required arguments:

cd CascadeRCG/evaluation
python get_scores.py -e /path/to/evaluation_data.json -t [ethics|rag] -r /path/to/results.json

@inproceedings{10.1145/3701716.3715466,
author = {Yang, Di and Zhu, Jingwei and Wu, Haihong and Tan, Minghuan and Li, Chengming and Yang, Min},
title = {CascadeRCG: Retrieval-Augmented Generation for Enhancing Professionalism and Knowledgeability in Online Mental Health Support},
year = {2025},
isbn = {9798400713316},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3701716.3715466},
doi = {10.1145/3701716.3715466},
abstract = {Online mental health support(OMHS) plays a crucial role in promoting well-being, but the shortage of mental health professionals necessitates automated systems to address complex care needs. While large language models (LLMs) are widely adopted, they often fall short in OMHS settings due to the complexity and ambiguity of the questions posed. Additionally, providing accurate answers requires extensive knowledge, which LLMs may lack, leading to responses that often lack depth, professionalism, and critical detail. To address these limitations, we introduce a new task tailored to OMHS scenarios, focusing on enhancing the professionalism and knowledgeability of generated responses. Furthermore, we propose a comprehensive benchmark designed to systematically evaluate the quality of responses. Building on these foundations, we propose the CascadeRCG framework, an optimized approach based on Retrieval-Augmented Generation (RAG). This framework first employs a knowledge management strategy, then introduces a two-stage cross-iterative Retrieval mechanism and a Clustering-then-summarizing module, followed by the final Generation stage. Experimental results on both single-turn and multi-turn psychological dialogue datasets, compared to other RAG-based baselines across different LLMs, show significant improvements in response professionalism and knowledge depth. This enhancement in response quality provides an effective methodology and strategy for further improving OMHS systems. Our code is available at https://github.com/CAS-SIAT-XinHai/CascadeRCG.},
booktitle = {Companion Proceedings of the ACM on Web Conference 2025},
pages = {1465–1469},
numpages = {5},
keywords = {LLM, NLP, RAG, online mental health support},
location = {Sydney NSW, Australia},
series = {WWW '25}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
books_list		books_list
evaluation		evaluation
images		images
prompts		prompts
src		src
vector_database		vector_database
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CascadeRCG

🌸 Vector Database

Features

Example

☺️ Generation

🔍 Evaluation

Criteria:

Evaluation Steps:

Setup

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CascadeRCG

🌸 Vector Database

Features

Example

☺️ Generation

🔍 Evaluation

Criteria:

Evaluation Steps:

Setup

Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages