
I am a first-year PhD student at TMH, KTH Royal Institute of Technology, advised by Prof. Gustav Eje Henter and Prof. Gabriel Skantze.
My PhD is funded by WASP.
I received my Master's degree in Applied Math and Computer Science from ITMO University.
Before my PhD, I worked in voice team in IDR&D Inc. as a Senior Machine Learning Engineer for five years, with a focus on speaker recognition and voice anti-spoofing.
Research
I'm interested in conversational AI and speech synthesis with a current focus on streaming TTS.
VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency
Submitted to ICASSP 2026 Preprint
We present VoXtream, a fully autoregressive, zero-shot streaming text-to-speech (TTS) system for real-time use that begins speaking from the first word... [Read more]

Reshape Dimensions Network for Speaker Recognition
Interspeech 2024
In this paper, we present Reshape Dimensions Network (ReDimNet), a novel neural network architecture for extracting utterance-level speaker representations... [Read more]

VoxTube: a Multilingual Speaker Recognition Dataset
Interspeech 2023
The objective of this paper is to advance the development of technologies in the fields of speaker recognition and speaker identification by introducing a large... [Read more]

A Subnetwork Approach for Spoofing Aware Speaker Verification
Interspeech 2022
This paper describes the ID R&D team submission to the Spoofing Aware Speaker Verification (SASV) challenge. Firstly, we present... [Read more]

Competitions
- VoxCeleb Speaker Recognition Challenge
- Spoofing-Aware Speaker Verification Challenge
-
Short-duration Speaker Verification Challenge
- 1st place in text-independent track
-
Kaggle
- Top 3% at BirdCLEF 2023 competition
- Top 4% at Rainforest Connection Species Audio Detection challenge