|
1 | 1 | # MAGmax |
2 | 2 | MAGmax is a dereplication tool designed to maximize the recovery of Metagenome-Assembled Genomes (MAGs) through bin Merging and reAssembly. It performs dereplication in three stages: (i) grouping bins based on average sequence identity, (ii) merging bins within each group, and (iii) reassembling the merged bins. |
3 | 3 |
|
4 | | -## INPUTS |
| 4 | +## Inputs |
5 | 5 | MAGmax requires three input directories, |
6 | 6 | 1. `binsdir`, directory containing bin files in FASTA format that need to be dereplicated. (e.g., output files from any metagenome binning tool) |
7 | 7 |
|
8 | 8 | 2. `readdir`, directory containing read files in FASTQ format for each sample. |
9 | 9 |
|
10 | 10 | 3. `mapid_dir`, directory containing mapping files for each sample. Each file is a text file listing read IDs and the corresponding contig IDs they mapped to. These files are used to retrieve reads that map to each merged bin from the FASTQ files in `readdir` and to generate new bin-specific FASTQ files for reassembly. |
11 | 11 |
|
12 | | -## OUTPUT |
| 12 | +## Output |
13 | 13 | An output directory named `mags_<x>comp_<y>purity` will be created, where `x` and `y` correspond to the user-specified completeness and purity thresholds used to select final bins. By default, MAGmax uses a percentage of 50 for completeness and 95 for purity. |
14 | 14 | The output directory contains dereplicated bins, and a text file listing the completeness and contamination scores for each bin as calculated by CheckM2. |
15 | 15 |
|
16 | | -### Example command line call |
| 16 | +## Example command line call |
17 | 17 |
|
18 | 18 | magmax -b <binsdir> -r <readdir> -m <mapid_dir> -f fasta -t 24 |
19 | 19 | magmax -b <binsdir> -r <readdir> -m <mapid_dir> -f fasta -t 24 -q quality_report.tsv // if CheckM2 result is already available |
20 | 20 | magmax -b <binsdir> -r <readdir> -m <mapid_dir> -f fasta -t 24 --split // if input bins are not already split by sample id |
21 | 21 |
|
22 | | -## Install |
| 22 | +## Installation |
23 | 23 | ### Prerequisites |
24 | 24 |
|
25 | 25 | - **Rust**: Follow the instructions [here](https://www.rust-lang.org/tools/install) to install Rust. |
@@ -87,7 +87,7 @@ Option 2: Build from source |
87 | 87 | -V, --version |
88 | 88 | Print version |
89 | 89 |
|
90 | | -### Test run using toy data |
| 90 | +## Test run using toy data |
91 | 91 | This example test run demonstrates dereplication of bins using the provided toy dataset. In the `test/bins` directory, example bins generated with MetaBAT2 are given. In the `test/reads` directory, paired-end read files for two samples are given and in the `test/mapids` directory, mapid files mapping reads to contigs for each sample are given. Precomputed CheckM2 quality scores for the input bins are given in the `test/quality_report.tsv`. Run the following command to execute the test: |
92 | 92 |
|
93 | 93 | magmax -b test/bins -r test/reads -m test/mapids -t 24 -q test/quality_report.tsv |
|
0 commit comments