Name	Name	Last commit message	Last commit date
parent directory ..
analyze	analyze
pipeline	pipeline
utils	utils
README.md	README.md
cleanup_pipeline_results.py	cleanup_pipeline_results.py
config.yaml	config.yaml
generate_all_families.sh	generate_all_families.sh
problem_family.json	problem_family.json
problem_family_legacy.json	problem_family_legacy.json
query_style.json	query_style.json
run.py	run.py
sampler.py	sampler.py
upload_to_hf.py	upload_to_hf.py

Name

Last commit message

Last commit date

analyze

pipeline

utils

README.md

cleanup_pipeline_results.py

config.yaml

generate_all_families.sh

problem_family.json

problem_family_legacy.json

SQL

(The SQL synthesized problem family is still under construction due to expensive API cost.)

Readers please feel free to contact authors if you are interested in this problem scope.

Dataset Structure (`./database`)

This version utilizes Bird (69db) and Spider's training dataset (166db), composed a total of 235db. Each database targets a certain domain and consists of multiple tables.

Pipeline Architecture

The pipeline implements a 9-step workflow with forward and backward passes for SQL generation quality improvement:

Setup Sampler: Initialize diversity sampling based on batch_size and num_iteration
Generate Query (Forward): Natural language query generation from database schema
Generate Groundtruth (Forward): SQL generation from natural language query
Verify Format: Execute SQL and materialize results to verdict database
Verify Groundtruth (Forward): LLM-based verification of query-SQL correctness and adherence
Generate Unit Test (Backward): Create comprehensive unit tests from SQL result table
Generate Query (Backward): Generate improved natural language query from SQL + result table
Verify Again (Backward): Execute unit tests for final verification
Save to Dataset: Convert to dataset.jsonl if verdict is "correct" and adherence is "adheres" or "partial"

Batch Execution Support

The pipeline supports batch execution and iterations:

Batch Size: Number of parallel pipeline runs per iteration (default: 5)
Iterations: Number of batch runs to execute (default: 1)
Diversity Sampling: Intelligent sampling for specification diversity across batches
Execution Logging: Full process logs saved to execution_log/ directory

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

SQL

(The SQL synthesized problem family is still under construction due to expensive API cost.)

Dataset Structure (`./database`)

Pipeline Architecture

Batch Execution Support

FilesExpand file tree

sql

Directory actions

More options

Directory actions

More options

Latest commit

History

sql

Folders and files

parent directory

README.md

SQL

(The SQL synthesized problem family is still under construction due to expensive API cost.)

Dataset Structure (./database)

Pipeline Architecture

Batch Execution Support

Dataset Structure (`./database`)