Skip to content
View WiktorProsowicz's full-sized avatar

Highlights

  • Pro

Block or report WiktorProsowicz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
WiktorProsowicz/README.md

About me

I am a Data Science student, mainly interested in developing Deep Neural Network-based Text-to-Speech systems. In the moment, I am attending the master's degree course in Computer Science and Intelligent Systems on AGH University of Krakow.


Feel free to check out my publication titled Using Denoising Diffusion Model for Predicting Global Style Tokens in an Expressive Text-to-Speech System that appeared in a special issue of Electronics. It contains results of months of research and work on developing a Deep Learning-based TTS system.

As for the professional experience, I've been able to gain a fair level of proficiency in C++ Software Development while working at Aptiv in Krakow. This gave me an opportunity to gain practical skills in developing object-oriented infrastructures, designing and implementing tests as well as optimizing and debugging the code.


I'm interested in applying advanced techniques for learning languages and currently I'm striving for getting an advanced level of fluency in German.

Primary projects

Name Description Tech stack
ddpm-gst-speech-synthesis A Deep Learning system performing Text-to-Speech synthesis with explicit prosody modeling based on diffusion generative algorithm. The project scopes data preprocessing, model training and evaluation of the system, both objective and human-based. Python, Pytorch
(WIP) paragraph-tts An advanced TTS system for synthesizing long-form speech, e.g. audiobook paragraphs. It scopes preparing complex numerical representations for large amounts of pre-processed data and training a Deep Learning model, leveraging advanced fast attention mechanism for improved length generalization. Python, Pytorch, MLFlow
(WIP) ai_projects A C++ framework that helps developing deep learning models. It contains building blocks for designing neural networks as well as significant algorithms for training it, i.e. back-propagation and tensor operations. C++, Googletest
advanced-data-mining-project Contains results of investigating sentiment analysis possibilities for a scraped dataset containing textual restaurant reviews. The repository contains pipeline for data preprocessing, involving composing advanced numerical representations for text content, and benchmarking multiple rating prediction models. The project includes professional experiment tracking and automated model's evaluation. Python, Pytorch, MLFlow, Transformers
(WIP) comp-int-proj-tts Contains implementation of a Conformer-based TTS model with several prosody modeling methods. The purpose of the project is to benchmark different configurations of the model in order to gain insight into the influence of the prosody modeling techniques on the model's performance. Python, Pytorch, MLFlow
(WIP) rag-app A containerized infrastructure serving an AI Chat that provides support on behalf of a chosen institution. The system spans a demo web application, a RAG-style knowledge base handling and a module responsible for providing content safety guardrails and streaming LLM responses. The infrastructure is designed to support self-hosted models. Python, LangChain, Docker, FastAPI, Gradio, Qdrant

Popular repositories Loading

  1. sound-resurrection sound-resurrection Public

    Project containing work related to the research in sound generation/restoration domain.

    Python 1

  2. ddpm-gst-speech-synthesis ddpm-gst-speech-synthesis Public

    Contains implementation of a TTS system based on Global Style Tokens and Denoising Diffusion Probabilistic Models

    Python 1

  3. OtterCards OtterCards Public

    Mobile app for revsing any kind of knowledge via flashcards.

    Python

  4. WiktorProsowicz WiktorProsowicz Public

    Config files for my GitHub profile.

  5. inz-oprog-projekt inz-oprog-projekt Public

    Group project for university classes.

    PHP 1

  6. ai_projects ai_projects Public

    Set of implementations of chosen algorithms from several fields of artificial intelligence

    C++