Nikita Torgashov

I am a second-year PhD student at TMH, KTH Royal Institute of Technology, advised by Prof. Gustav Eje Henter and Prof. Gabriel Skantze. My PhD is funded by WASP.
I received my Master's degree in Applied Math and Computer Science from ITMO University. Before my PhD, I worked in voice team in IDR&D Inc. as a Senior Machine Learning Engineer for five years, with a focus on speaker recognition and voice anti-spoofing.

Research

I'm interested in conversational AI and speech synthesis with a current focus on streaming TTS.

VoXtream2: Full-stream TTS with dynamic speaking rate control

Nikita Torgashov, Gustav Eje Henter, Gabriel Skantze

Submitted to Interspeech 2026 (Long paper)

Full-stream text-to-speech (TTS) for interactive systems must start speaking with minimal delay while remaining controllable as text arrives incrementally.... [Read more]

VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency

Nikita Torgashov, Gustav Eje Henter, Gabriel Skantze

Accepted to ICASSP 2026 (Oral)

We present VoXtream, a fully autoregressive, zero-shot streaming text-to-speech (TTS) system for real-time use that begins speaking from the first word... [Read more]

Reshape Dimensions Network for Speaker Recognition

Ivan Yakovlev, Rostislav Makarov, Andrei Balykin, Pavel Malov, Anton Okhotnikov, Nikita Torgashov

Interspeech 2024

In this paper, we present Reshape Dimensions Network (ReDimNet), a novel neural network architecture for extracting utterance-level speaker representations... [Read more]

VoxTube: a Multilingual Speaker Recognition Dataset

Ivan Yakovlev, Anton Okhotnikov, Nikita Torgashov, Rostislav Makarov, Yuri Voevodin, Konstantin Simonchik

Interspeech 2023

The objective of this paper is to advance the development of technologies in the fields of speaker recognition and speaker identification by introducing a large... [Read more]

A Subnetwork Approach for Spoofing Aware Speaker Verification

Alexander Alenin, Nikita Torgashov, Anton Okhotnikov, Rostislav Makarov, Ivan Yakovlev

Interspeech 2022

This paper describes the ID R&D team submission to the Spoofing Aware Speaker Verification (SASV) challenge. Firstly, we present... [Read more]

LRPD: Large Replay Parallel Dataset

Ivan Yakovlev, Mikhail Melnikov, Nikita Bukhal, Rostislav Makarov, Alexander Alenin, Nikita Torgashov, Anton Okhotnikov

ICASSP 2022

The latest research in the field of voice anti-spoofing (VAS) shows that deep neural networks (DNN) outperform classic approaches like GMM in the task of... [Read more]

Services

Reviewer
- ICASSP 2026
- Odyssey 2024: The Speaker and Language Recognition Workshop

Competitions

VoxCeleb Speaker Recognition Challenge
- 1st place in open track in 2023 and 2022
Spoofing-Aware Speaker Verification Challenge
- 1st place
Short-duration Speaker Verification Challenge
- 1st place in text-independent track
Kaggle
- Top 3% at BirdCLEF 2023 competition
- Top 4% at Rainforest Connection Species Audio Detection challenge