OpenLATTE

OpenLATTE is a reimplementation of the LATTE (LLM-Powered Static Binary Taint Analysis) static analysis pipeline for discovering vulnerabilities in stripped binaries. The project automates the three major phases described in the paper:

Function classification – external library calls are labelled as potential taint sources or sinks using an LLM.
Flow extraction – Ghidra scripts identify vulnerable destinations and build call chains that trace the flow of tainted data.
LLM inspection – the discovered flows are analysed by an LLM to report possible vulnerabilities. The rag.py script optionally augments these prompts with a retrieval‑augmented knowledge base (RAG).

The repository contains helper scripts for running Ghidra in headless mode, exporting code for a knowledge base and querying either a local Ollama model or Google Gemini.

Repository layout

build/                 # Compiled test binaries and symbol maps
results/               # JSON output of each analysis stage
ghidra-workspace/      # Ghidra projects created during headless runs
*.py, *.sh             # Analysis scripts used in each LATTE phase

Requirements

Python 3.9+
Ghidra 11.3.2 with the Ghidrathon 4.0 plugin
A running LLM backend
- Local model via Ollama (classifyLocal.py, inspect_flows_with_llm.py)
- Google Gemini for higher quality results (classifyGemini.py, rag.py)
pip install -r requirement.txt

Several scripts expect environment variables such as GOOGLE_API_KEY or GOOGLE_API_KEY42 to be set with your Gemini key.

Basic workflow

Export external functions

./flow.sh /path/to/binary.out   # runs Ghidra to dump external functions

Classify as sources or sinks

python3 batch_classify.py --ext-funcs build/external_funcs_<binary>.out.txt \
    --mode sink   --output-dir results
python3 batch_classify.py --ext-funcs build/external_funcs_<binary>.out.txt \
    --mode source --output-dir results

Find dangerous flows (headless Ghidra)

./DF.sh   # wrapper around find_dangerous_flows.py

Export code for each flow

"<ghidra>/support/analyzeHeadless" <workspace> ProjectName \
    -import <binary> -scriptPath . -postScript export_flow_code.py -deleteProject

Inspect flows with an LLM

python3 inspect_flows_with_llm.py \
    --flows-with-code results/flows_with_code_<binary>.json \
    --sources results/source_classification_<binary>.json \
    --output results/vulnerability_reports.json

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
Data		Data
ghidra-workspace		ghidra-workspace
oldcodes		oldcodes
results		results
resultss		resultss
.gitignore		.gitignore
README.md		README.md
TestSimple.py		TestSimple.py
batch_classify.py		batch_classify.py
classifyGemini.py		classifyGemini.py
classifyLocal.py		classifyLocal.py
export_external_funcs.py		export_external_funcs.py
export_external_funcs.sh		export_external_funcs.sh
export_flow_code.py		export_flow_code.py
export_flow_code.sh		export_flow_code.sh
find_dangerous_flows.py		find_dangerous_flows.py
find_dangerous_flows.sh		find_dangerous_flows.sh
inspect_flows_with_llm.py		inspect_flows_with_llm.py
requirement.txt		requirement.txt
run_export_kb_function.sh		run_export_kb_function.sh
run_openLATTE_batch.sh		run_openLATTE_batch.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenLATTE

Repository layout

Requirements

Basic workflow

About

Uh oh!

Releases

Packages

Languages

Hsmnasiri/OpenLATTE

Folders and files

Latest commit

History

Repository files navigation

OpenLATTE

Repository layout

Requirements

Basic workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages