Nikita Torgashov

Portrait of Nikita Torgashov

I am a first-year PhD student at TMH, KTH Royal Institute of Technology, advised by Prof. Gustav Eje Henter and Prof. Gabriel Skantze. My PhD is funded by WASP.
I received my Master's degree in Applied Math and Computer Science from ITMO University. Before my PhD, I worked in voice team in IDR&D Inc. as a Senior Machine Learning Engineer for five years, with a focus on speaker recognition and voice anti-spoofing.

Research

I'm interested in conversational AI and speech synthesis with a current focus on streaming TTS.

VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency

Nikita Torgashov, Gustav Eje Henter, Gabriel Skantze

Submitted to ICASSP 2026 Preprint

We present VoXtream, a fully autoregressive, zero-shot streaming text-to-speech (TTS) system for real-time use that begins speaking from the first word... [Read more]

VoXtream project figure

Reshape Dimensions Network for Speaker Recognition

Ivan Yakovlev, Rostislav Makarov, Andrei Balykin, Pavel Malov, Anton Okhotnikov, Nikita Torgashov

Interspeech 2024

In this paper, we present Reshape Dimensions Network (ReDimNet), a novel neural network architecture for extracting utterance-level speaker representations... [Read more]

ReDimNet project figure

VoxTube: a Multilingual Speaker Recognition Dataset

Ivan Yakovlev, Anton Okhotnikov, Nikita Torgashov, Rostislav Makarov, Yuri Voevodin, Konstantin Simonchik

Interspeech 2023

The objective of this paper is to advance the development of technologies in the fields of speaker recognition and speaker identification by introducing a large... [Read more]

VoxTube project figure

A Subnetwork Approach for Spoofing Aware Speaker Verification

Alexander Alenin, Nikita Torgashov, Anton Okhotnikov, Rostislav Makarov, Ivan Yakovlev

Interspeech 2022

This paper describes the ID R&D team submission to the Spoofing Aware Speaker Verification (SASV) challenge. Firstly, we present... [Read more]

SASV project figure

LRPD: Large Replay Parallel Dataset

Ivan Yakovlev, Mikhail Melnikov, Nikita Bukhal, Rostislav Makarov, Alexander Alenin, Nikita Torgashov, Anton Okhotnikov

ICASSP 2022

The latest research in the field of voice anti-spoofing (VAS) shows that deep neural networks (DNN) outperform classic approaches like GMM in the task of... [Read more]

LRPD project figure

Competitions

  • VoxCeleb Speaker Recognition Challenge
    • 1st place in open track in 2023 and 2022
  • Spoofing-Aware Speaker Verification Challenge
  • Short-duration Speaker Verification Challenge
    • 1st place in text-independent track
  • Kaggle
    • Top 3% at BirdCLEF 2023 competition
    • Top 4% at Rainforest Connection Species Audio Detection challenge