Sayaka Shiota

Sayaka Shiota

Contact information

  • Address: 6-6 Asahigaoka, Hino-shi, Tokyo, 191–0065
  • E-Mail: sayaka[at]tmu.ac.jp

Biography

Sayaka Shiota received her B.E., M.E., and Ph.D. degrees in intelligence and computer science, Engineering, and engineering simulation from Nagoya Institute of Technology in 2007, 2009, and 2012, respectively. From February 2013 to March 2014, she worked as a project assistant professor at the Institute of statistical mathematics. In 2014, she joined Tokyo Metropolitan University as an assistant professor and became an associate professor in 2023. Her research interests include statistical speech recognition and speaker verification. She is a member of ASJ, IPSJ, IEICE, APSIPA, ISCA, and IEEE.

  • Professional Experiences
    • Apr. 2023 – Present : Associate Professor of the Department of Information and Communication Systems, Graduate School of System Design, Tokyo Metropolitan University
    • Apr. 2014 – Mar. 2023 : Assistant Professor of the Department of Information and Communication Systems, Graduate School of System Design, Tokyo Metropolitan University
    • Feb. 2013 – Mar. 2014 : Project Assistant Professor of Research Center for Statistical Machine Learning of the Institute of Statistical Mathematics (ISM)
    • Apr. 2012 – Jan. 2013 : Research fellow (PD) of the Japan Society for the Promotion of Science (JSPS)
    • Jun. 2012 – Nov. 2012 : Visiting Researcher at the University of Edinburgh
    • Apr. 2011 – Mar. 2012 : Research fellow (DC2) of the Japan Society for the Promotion of Science (JSPS)
    • Feb. 2010 – Mar. 2011 : Research fellow of the MIC SCOPE project
    • Oct. 2009 – Jan. 2010 : Internal researcher of the National Institute of Information and Communications Technology(NICT)
    • Apr. 2009 – Sep. 2009 : Research fellow of the FP7 EMIME project
  • Educations
    • Apr. 2009 – Mar. 2012 : Nagoya Institute of Technology, Department of Scientific and Engineering Simulation (Ph.D.)
    • Apr. 2007 – Mar. 2009 : Nagoya Institute of Technology, Department of Computer Science and Engineering(Master)
    • Apr. 2003 – Mar. 2007 : Nagoya Institute of Technology, Department of Computer Science
  • Award
    • Jul. 2012 IEICE ISS Young Researcher’s Award in Speech Field
    • Mar. 2012 Vice-president prize (Student Award at Nagoya Institute of Technology)
    • Sep. 2011 Best Student Presentation Award at ASJ 2011 Spring meeting
  • Software

Publications

Journal Paper

  • “A Bayesian framework using multiple model structures for speech recognition,”
    Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda,
    IEICE Transactions on Information and Systems, vol. E96-D, no. 4, pp.939–948, 2013-4.
  • “Speech recognition based on statistical models including multiple phonetic decision trees,”
    Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda,
    Acoust. Sci. & Tech., vol. 32, no. 6, pp.236–243, 2011-11.

    International Conference

  • Interspeech 2012 in Portland, USA (Sep. 9-13 2012)“Cross-lingual Speaker Adaptation for HMM-based Speech Synthesis Using Speaker Interpolation Based on Perceptual Characteristics,”
    Viviane de Franca Oliveira, Sayaka Shiota, Yoshihiko Nankaku, Keiichi Tokuda,
    in Proc. Interspeech 2012, 2012-9.
  • ICASSP 2012 in Kyoto, Japan (Mar. 25-30 2012)“A model structure integration based on a Bayesian framework for speech recognition,”
    Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, in Proc. ICASSP 2012, pp. 4813-4816, 2012-03.
  • Blizzard Challenge 2010 in Kyoto,Japan (Sep. 25 2010)“Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2010,”
    Keiichiro Oura, Kei Hashimoto, Sayaka Shiota, and Keiichi Tokuda, Blizzard Challenge 2010, 2010-09.
  • SSW7 in Kyoto,Japan (Sep. 22-24 2010)“Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project,”
    Mirjam Wester, John Dines, Matthew Gibson, Hui Liang, Yi-Jian Wu, Lakshmi Saheer, Simon King, Keiichiro Oura, Philip N. Garner, William Byrne, Yong Guan, Teemu Hirsimaki, Reima Karhila, Mikko Kurimo, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Junichi Yamagishi, SSW7, pp.192-197, 2010-09.
  • ACL 2010 in Uppsala, Sweden (July 13 2010)“Personalising speech-to-speech translation in the EMIME project,”
    Mikko Kurimo, William Byrne, John Dines, Philip N. Garner, Matthew Gibson, Yong Guan, Teemu Hirsimaki, Reima Karhila, Simon King, Hui Liang, Keiichiro Oura, Lakshmi Saheer, Matt Shannon, Sayaka Shiota, Jilei Tian, Keiichi Tokuda, Mirjam Wester, Yi-Jian Wu and Junichi Yamagishi, ACL 2010, pp.48-53, 2010-07.
  • Interspeech 2008 in Brighton, UK (Sep. 7-10 2009)“Deterministic Annealing Based Training Algorithm for Bayesian Speech Recognition,”
    Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, in Proc. Interspeech 2009, pp.680–683, 2009-9.
  • Interspeech 2008 in Brisbane, Australia (Sep. 22-26 2008)“Acoustic Modeling Based Model Structure Annealing for Speech Recognition,”
    Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, in Proc. Interspeech, pp.932–935, 2008-9.

    Technical report

  • SP2012 in Kanagawa, Japan (Jun. 14-15 2012)“Perceptual evaluation of synthesized speech reflecting “personaliies”,”
    Minoru Tsuzaki, Keiichi Tokuda, Hisashi Kawai, Yoshinori Shiga, Jinfu Ni, Keiichiro Oura, Sayaka Shiota,
    IEICE Technical Report, 2012-6.
  • SP2011 in Nagoya, Japan (Jun. 23-24 2011)“Bayesian speech recognition based on model structure integration,”
    Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, IEICE Technical Report, vol. 111, no. 97, pp.11–16, 2011-6. (Young Researcher’s Award)
  • SLP2008 in Tokyo, Japan(Dec. 9-10 2008)“Speech Recognition Based on Statistical Models Including Multiple Decision Trees,”
    Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda,
    IEICE Technical Report, vol.108, no. 338, pp. 221–226, 2008-12.
  • SP2007 in Toyama, Japan (Jun. 26-27 2007)“Acoustic Modeling Based on Model Structure Annealing for Speech Recognition,”
    Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda,
    IEICE Technical Report, vol. 107, no. 165, pp. 67–72, 2007-7.

    Domestic Conference

  • Acoustical Society of Japan(ASJ) 2012 Spring Meeting in Tokyo, Japan (Mar. 13-15 2013)“Cross-lingual speaker adaptation for HMM-based speech synthesis using joint-eigenvoices with a space of perceptual characteristics,”
    Viviane de Franca Olivera, Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, ASJ2012 spring meeting, pp. 269–270, 2013-3.
  • Acoustical Society of Japan(ASJ) 2012 Spring Meeting in Kanagawa, Japan (Mar. 13-15 2012)“Cross-lingual Speaker Adaptation for HMM-based speech synthesis using speaker interpolation based on perceptual characteristics,”
    Viviane de Franca Olivera, Sayaka Shiota, Yoshihiko Nankaku, Keiichi Tokuda, ASJ2012 spring meeting, pp. 405–406, 2012-3.
  • Acoustical Society of Japan(ASJ) 2011 Spring Meeting in Tokyo, Japan (Mar. 9-11 2011)“Acoustic modeling based model structure annealing for Bayesian speech recognition,”
    Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Keiichi Tokuda, ASJ2011 spring meeting, pp. 21–24, 2011-3. (Student award)
  • Acoustical Society of Japan(ASJ) 2009 Autumn Meeting in Fukushima, Japan (Sep. 15-17 2009)“Training Algorithm Based on Deterministic Annealing for Bayesian Speech Recognition,”
    Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, and Keiichi Tokuda, ASJ2009 autumn meeting, pp. 3–6, 2009-9.
  • Acoustical Society of Japan(ASJ) 2008 Autumn Meeting in Fukuoka, Japan (Sep. 10-12 2008)“Speech recognition based on multiple phonetic decision tree structures,”
    Sayaka Shiota, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, ASJ2008 autumn meeting, pp. 125–126, 2008-3.
  • Acoustical Society of Japan(ASJ) 2007 Autumn Meeting in Yamanashi, Japan (Sep. 19-21 2007)“Acoustic Modeling Based on Model Structure Annealing for Speech Recognition,”
    Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, and Keiichi Tokuda, ASJ2008 autumn meeting, pp. 142–146, 2007-9.

    Thesis

  • Doctoral Dissertation (Feb. 2012)“ACOUSTIC MODELING BASED ON STATISTICAL MODELS USING MULTIPLE MODEL STRUCTURES”
  • Master’s thesis (Feb. 2009)“SPEECH RECOGNITION BASED ON STATISTICAL MODELS INCLUDING MULTIPLE MODEL STRUCTURES”
  • Graduation thesis (Feb. 2007)