RNA-seq Analysis Pipeline

Author: Sagar Tank
Email: tank.sag@northeastern.edu

Overview

Single-end RNA-seq analysis pipeline using FastQC, Trimmomatic, HISAT2, and featureCounts.

Workflow

FASTQ → FastQC → Trimmomatic → HISAT2 → featureCounts

Tools Used

FastQC v0.12.1 - Quality control
Trimmomatic v0.40 - Adapter and quality trimming
HISAT2 v2.2.1 - Read alignment
featureCounts v2.1.1 - Gene quantification

Requirements

macOS (tested on M1 Pro)
Conda/Miniforge
Reference genome: GRCh38
GTF annotation: Ensembl 106

Installation

# Install tools via conda
conda install -c bioconda fastqc trimmomatic hisat2 samtools subread

# Download reference genome
curl -O https://genome-idx.s3.amazonaws.com/hisat/grch38_genome.tar.gz
tar -xzf grch38_genome.tar.gz

# Download GTF annotation
curl -O http://ftp.ensembl.org/pub/release-106/gtf/homo_sapiens/Homo_sapiens.GRCh38.106.gtf.gz
gunzip Homo_sapiens.GRCh38.106.gtf.gz

Usage

# Place your FASTQ file in data/fastq/
# Run the pipeline
bash rnaseq_pipeline.sh

Results

FastQC reports: data/*_fastqc.html
Trimmed reads: data/demo_trimmed.fastq
Alignment: HISAT2/demo_trimmed.bam
Gene counts: quants/demo_featurecounts.txt

Example Results

From demo dataset (1,250,000 reads):

Trimming: 94.74% reads retained
Alignment: 93.98% mapping rate
Gene assignment: 62.3% (843,012 reads)

Project Structure

rnaseq-pipeline/
├── rnaseq_pipeline.sh    # Main pipeline script
├── data/                 # FastQC reports
├── HISAT2/              # BAM alignment files
└── quants/              # Gene count tables

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
quants		quants
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
rnaseq_pipeline.sh		rnaseq_pipeline.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RNA-seq Analysis Pipeline

Overview

Workflow

Tools Used

Requirements

Installation

Usage

Results

Example Results

Project Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RNA-seq Analysis Pipeline

Overview

Workflow

Tools Used

Requirements

Installation

Usage

Results

Example Results

Project Structure

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages