🎓 Master's student in Bioinformatics at Alma Mater Studiorum – Università di Bologna
🔬 Focused on machine learning for biological sequences, protein language models, and computational biology
📍 Modena, Italy
I am an MSc Bioinformatics student at the University of Bologna, focusing on protein sequence modelling, protein language models, and ML-based sequence-function prediction.
My current work includes ESM-2-based protein sequence classification, HMMER/MMseqs2 workflows for protein domain annotation, and computational biology pipelines. I am especially interested in applying machine learning to protein function prediction, protein representation learning, and generative protein design.
Languages
ML & Data Science
Bioinformatics
Tools
Protein sequence-function prediction pipeline for eukaryotic signal peptide detection using UniProtKB/Swiss-Prot annotations, MMseqs2 redundancy reduction, classical biological baselines, SVMs with biochemical features, and CNN-BiLSTM models with ESM-2 protein language model embeddings.
Python PyTorch ESM-2 MMseqs2 Protein Language Models Bioinformatics
Protein domain annotation workflow using profile Hidden Markov Models, HMMER, and MMseqs2 to compare sequence-based and structure-informed approaches for Kunitz domain detection. Structure-based HMM achieved peak MCC of 0.997.
Python HMMER MMseqs2 Protein Domains Structural Bioinformatics
Differential DNA methylation analysis of CpG sites between healthy and diseased individuals using Illumina HumanMethylation450k array data, including preprocessing, normalization, PCA-based quality assessment, and differential methylation analysis.
R Bioconductor minfi Epigenomics
- 📧 mahan.balooei@studio.unibo.it
- 🔬 ORCID: 0009-0006-5358-0784
- 🏛️ Alma Mater Studiorum – Università di Bologna

