Publications | 岸田拓也 Takuya Kishida

PUBLICATIONS (last update 2023.4.25)

Papers

Kishida, T., Nakashika, T. (2020). Speech chain VC: linking linguistic and acoustic levels via latent distinctive features for RBM-based voice conversion, IEICE TRANSACTIONS on Information and Systems, Vol.E103-D, No.11, pp.2340-2350.
Zhang, Y. 1, Nakajima, Y., Ueda, K., Kishida, T., & Remijn, G. B., (2020). Comparison of Multivariate Analysis Methods as Applied to English Speech, Appl. Sci. 10, 7076; https://doi.org/10.3390/app10207076
Nakajima, Y., Ueda, K., Remijn, G. B., Yamashita, Y., & Kishida, T., (2018). How sonority appears in speech analyses, Acoust. Sci. & Tech., 39, 179-181.
中島祥好，上田和夫，G. B. レメイン，山下友子，岸田拓也, (2018). 音声分析に現れる鳴音性，日本音響学会誌，74号，pp. 93-96.
Kishida, T., Nakajima, Y., Ueda, K., & Remijn, G. B. (2016). Three factors are critical in order to synthesize intelligible noise-vocoded Japanese speech. Frontiers in psychology, 7:517.

Presentations

International

Kishida, T., & Nakashika, T. (2022). Non-parallel voice conversion based on free-energy minimization of speaker-conditional restricted boltzmann machine. In 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) IEEE.
Isako, T., Onishi, K., Kishida, T., & Nakashika, T. (2022). Controllable voice conversion based on quantization of voice factor scores. In 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) IEEE.
Kishida, T., Tsukamoto, S., & Nakashika, T. (2020). Simultaneous Conversion of Speaker Identity and Emotion Based on Multiple-Domain Adaptive RBM. Proc. Interspeech 2020, 3431-3435.
Zhang, Y., Nakajima, Y., Yu, X., Remijn, G. B., Ueda, K., Kishida, T., & Elliott M. A. (2019). Acoustic analysis of word-initial consonant clusters: a perceptual basis of English syllables. The 35th Annual Meeting of the International Society for Psychophysics. Antalya, Turkey, September 2019. (refereed)
Kishida, T., Nakajima, Y., Ueda, K., Remijn, G. B., & Umemoto, S. (2018). Which spectral-change factors are indispensable to make Japanese speech intelligible? The 3rd International Five-Sense Symposium (5-SENSE 2018). Fukuoka, Japan, September 2018.
Kishida, T., Nakajima, Y., Ueda, K., Remijn, G. B., & Umemoto, S. (2018). A spectral-change factor related to sonority makes noise-vocoded Japanese speech intelligible. The 34th Annual Meeting of the International Society for Psychophysics. Lüneburg, Germany, August 2018. (refereed)
Yu, X., Nakajima, Y., Zhang, T., Kishida, T., & Ueda, K. (2018). An acoustic analysis of preposition phrases in English. The 34th Annual Meeting of the International Society for Psychophysics. Lüneburg, Germany, August 2018. (refereed)
Zhang, Y., Yu. X., Nakajima, Y., Ueda, K., Kishida, T., Elliott, M. A., & Arndt, S. (2018). Multivariate acoustic analysis of initial consonant clusters in English. The 34th Annual Meeting of the International Society for Psychophysics. Lüneburg, Germany, August 2018. (refereed)
Kishida, T., Nakajima, Y., Ueda, K., Remijn, G. B., Arndt, S., & Elliott, M. A. (2017). Intelligibility of English noise-vocoded speech resynthesized from spectral-change factors. The 33rd Annual Meeting of the International Society for Psychophysics. Fukuoka, Japan, October 2017. (refereed)
Nakajima, T., Ueda, K., Remijn, G. B., Yamashita, Y., & Kishida, T. (2017). Phonology and psychophysics: Is sonority real? The 33rd Annual Meeting of the International Society for Psychophysics. Fukuoka, Japan, October 2017. (refereed)
Fujii, Y., Nakajima, T., Kishida, T., & Seno, T. (2017). Is the acoustic language universal really universal? The 33rd Annual Meeting of the International Society for Psychophysics. Fukuoka, Japan, October 2017. (refereed)
Tomimatsu, E., Nakajima, Y., Elliott, M. A., Ito, H., & Kishida, T. (2017). The relationship between the filled duration illusion and the time dilation illusion. The 33rd Annual Meeting of the International Society for Psychophysics. Fukuoka, Japan, October 2017. (refereed)
Kishida, T., Nakajima, Y., Ueda, K., & Remijn G. B. (2016). Effects of factor elimination on intelligibility of noise-vocoded Japanese speech. 　 The 31st International Congress of Psychology. Yokohama, Japan, September 2016. (refereed)
Kishida, T., Nakajima, Y., & Nakao, K. (2016). Origin shifted principal component analysis: A method suitable for reconstructing non-negative data. 　 The 31st International Congress of Psychology. Yokohama, Japan, September 2016. (refereed)
Nakao, K., Nakajima, Y., & Kishida, T. (2016). Perceptual validity and analytical advantages of non-negative bases extracted from factor analyses of Japanese speech. The 31st International Congress of Psychology. Yokohama, Japan, September 2016. (refereed)
Kishida, T., Nakajima, Y., Ueda, K., & Remijn G. B. (2015). A critical number of power-fluctuation factors needed for Japanese noise-vocoded speech perception. The 31st Annual Meeting of the International Society for Psychophysics. Quebec, Canada, August 2015. (refereed)
Hasuo, E., Nakajima, Y., Kishida, T., Remijn, G. B., & Ueda, K. (2015). Perceived durations of filled and empty intervals. The 31st Annual Meeting of the International Society for Psychophysics. Quebec, Canada, August 2015. (refereed)
Kishida, T., Nakajima, Y., Ueda, K., & Remijn G. B. (2014). Perceptual roles of power fluctuation factors in speech. The 13th International Conference on Music Perception and Cognition and the 5th Conference for the Asian-Pacific Society for Cognitive Sciences of Music. Seoul, South Korea, August 2014. (refereed)
Hasuo, E., Ueda, K., Kishida, T., Fujihira, H., Morimoto, S., Remijn, G. B., Shiraishi, K., Tobimatsu, S., & Nakajima, Y. (2014). Duration perception of filled and empty intervals —a study with magnitude estimation and electroencephalography. The 13th International Conference on Music Perception and Cognition and the 5th Conference for the Asian-Pacific Society for Cognitive Sciences of Music. Seoul, South Korea, August 2014. (refereed)
Kishida, T., Nakajima, Y., Ueda, K. (2013). Effects of elimination of power-fluctuation factors from critical-band noise-vocoded speech. The 29th Annual Meeting of the International Society for Psychophysics. Freiburg i. Br., Germany, October 2013. (refereed)
Hasuo, E., Nakajima, Y., Kishida, T., Tomimatsu, E., Ueda, K., & Grondin, S. (2013). The filled duration illusion with the method of adjustment when filled vs. empty comparison intervals are used. The 29th Annual Meeting of the International Society for Psychophysics. Freiburg i. Br., Germany, October 2013. (refereed)
Kishida, T., Nakajima, Y., Ueda, K. (2013). Perceptual roles of power-fluctuation factors of speech sound revealed by cepstral liftering and zero-shifted factor analysis. The 22nd Virtual Reality Psychology International Conference. Fukuoka, Japan, October 2013.

Domestic

岸田拓也, 中鹿亘, 「入力特徴量で条件づけた拡散確率モデルによるパラレル声質変換」, 日本音響学会音声研究会, 沖縄, 2023年3月.
羽賀洋克, 矢田部浩平, 岸田拓也, 中鹿亘, 「振幅重み付けエネルギー関数を用いたボルツマンマシンによる位相復元」, 日本音響学会2023年春季研究発表会，3-3-13，pp. 769-770, 2023年3月.
奥田耕平岸田拓也, 中鹿亘, 「Dual Diffusion Implicit Bridgesを用いた話者間の匿名性を担保した声質変換」, 日本音響学会2023年春季研究発表会，1-3Q-10, 2023年3月.
許誠, 岸田拓也, 中鹿亘, 「Speechsplit を用いたイントネーション・リズム・発音の矯正による外国語アクセント変換」, 日本音響学会2023年春季研究発表会，3-3P-11, 2023年3月.
岸田拓也, 中鹿亘, 「条件付き制限ボルツマンマシンの平衡化傾向を利用したノンパラレル声質変換」, 日本音響学会2022年秋季研究発表会，2-Q-48，pp. 1305-1306, 2022年9月.
井硲巧, 大西弘太郎, 岸田拓也, 中鹿亘, 「話者因子係数の量子化に基づく声色制御可能な話者変換」, 日本音響学会2022年秋季研究発表会, 2-Q-47, pp. 1301-1304, 2022年9月.
越森道貴, 嵯峨山茂樹, 岸田拓也, 中鹿亘, 「F0適応ラグ窓を用いた音声分析系の精緻化」，音学シンポジウム2022, 2022年6月
古田翔太郎, 岸田拓也, 中鹿亘, 「制限ボルツマンマシンを用いた独立低ランク行列分析に基づくブラインド音源分離」，音学シンポジウム2022, 2022年6月
平本佳弘, 嵯峨山茂樹, 岸田拓也, 中鹿亘, 「LSP周波数間隔のクロスエントロピー誤差最小化に基づくVAE声質変換」，音学シンポジウム2022, 2022年6月
王庭輝, 岸田拓也, 中鹿亘, 「リズムスタイルを考慮したFader Networksに基づく外国語学習者の発音変換」, 日本音響学会2022年春季研究発表会, 1-3P-14, pp. 1043-1044, 2022年3月
Zhou Yujin, 岸田拓也, 中鹿亘, 「TTSモデルにおけるアラインメントロバスト性向上のための非停滞化制約付きForward Attention」, 日本音響学会2022年春季研究発表会, 2-3P-7, pp. 1113-1114, 2022年3月.
岡留有希, 大西弘太郎, 岸田拓也, 中鹿亘, 「印象表現語ラベルを用いたFaderNetworksに基づく音声印象変換」, 日本音響学会2022年春季研究発表会, 1-3-7, pp. 897-900, 2022年3月.
羽賀洋克, 矢田部浩平, 岸田拓也, 中鹿亘, 「時系列条件付きボルツマンマシンによる位相復元」, 日本音響学会2022年春季研究発表会,2-3-2, pp. 943-946, 2022年3月.
飯田紘崇, 岸田拓也, 中鹿亘, 「マルチモーダルVAEを用いた顔画像に基づく目標話者音声不要な声質変換」, 日本音響学会2022年春季研究発表会, 2-3-3, pp. 947-950, 2022年3月.
岸田拓也, 中鹿亘, 「深層エネルギーベースモデルによる音声の音響特徴量の生成」, 日本音響学会2021年秋季研究発表会, 2-3-1, pp. 901-904, 2021年 9月
井硲巧, 岸田拓也, 中鹿亘, 「話者依存度に応じた特徴抽出器によるdisentagleな声質変換」, 日本音響学会2021年秋季研究発表会, 1-3-14, pp. 887-888, 2021年 9月
井硲巧，岸田拓也，中鹿亘，「話者特徴抽出器を加えたFaderNetVCによる未知話者声質変換」，音学シンポジウム2021, オンライン開催，2021年6 月
岸田拓也，中鹿亘，「Attention RBMによる音声特徴量系列の符号化と生成」，日本音響学会2021年春季研究発表会，オンライン開催, 2021 年3 月
羽賀洋克，矢田部浩平，岸田拓也，中鹿亘，「条件付きボルツマンマシンによる位相復元の初期検討」，日本音響学会2021年春季研究発表会，オンライン開催, 2021 年3 月
岸田拓也，中鹿亘，「Cluster ARBM を用いた話者・音韻相互作用分類による声質変換」，日本音響学会2020年秋季研究発表会，オンライン開催, 2020 年9 月
岸田拓也，中鹿亘，「Speech chain を模倣したボルツマンマシンによるワンショット多対多声質変換の検討」日本音響学会2020年春季研究発表会，さいたま，2020 年3 月
塚本伸，岸田拓也，中鹿亘，「適応型 RBM を用いた音声情報の分離による話者と感情の同時変換」日本音響学会2020年春季研究発表会，さいたま，2020 年3 月
荒川賢也，岸田拓也，中鹿亘，「マルチタスクモデルを用いたdisentangleな学習による楽器音変換」日本音響学会2020年春季研究発表会，さいたま，2020 年3 月
岸田拓也，中鹿亘，「Speech chain VC: 音声コミュニケーションの言語-生理-音響連鎖を考慮する声質変換」，日本音響学会2019年秋季研究発表会，滋賀，2019 年9 月
塚本伸，岸田拓也，中鹿亘，「適応型 RBM を用いたノンパラレル感情音声変換」，日本音響学会2019年秋季研究発表会，滋賀，2019 年9 月
荒川賢也，岸田拓也，中鹿亘，「Fader Networks を用いた楽器音変換」，日本音響学会2019年秋季研究発表会，滋賀，2019 年9 月
岸田拓也，伊藤裕之，「まばたき直後の視覚体験の時間的遡り：視聴覚刺激の時間順序判断課題による測定」，日本視覚学会　2019 年　冬季大会，神奈川，2019 年1 月
于暁陽，中島祥好，張一新，岸田拓也，上田和夫，「英語における前置詞句についての音響分析」，言語資源活用ワークショップ2018，東京，2018年9月
張一新，中島祥好，于暁陽，岸田拓也，上田和夫，「英語における頭子音連結の多変量解析」，言語資源活用ワークショップ2018，東京，2018年9月
岸田拓也，中島祥好，上田和夫，Gerard B. Remijn，梅本晟弥，「パワースペクトル因子を用いて合成した雑音駆動音声の知覚：鳴音性と明瞭度との関係」，日本音響学会聴覚研究会，沖縄，2018年3月
中尾貫志，岸田拓也，中島祥好，「臨界帯域パワー変動に着目した歌唱音声と朗読音声との比較」，日本音響学会聴覚研究会，沖縄，2018年3月
岸田拓也，中島祥好，上田和夫，Gerard B. Remijn，中尾貫志，「臨界帯域パワー変動因子を用いた雑音駆動音声の合成—因子の除去が明瞭度に与える効果—」，日本音響学会聴覚研究会，京都，2017年3月
中尾貫志，岸田拓也，中島祥好，亀岡弘和，「臨界帯域パワー変動から抽出された非負基底の音声再合成における妥当性」，日本音響学会聴覚研究会，京都，2017年3月
岸田拓也，亀岡弘和，中島祥好，「トラジェクトリ隠れマルコフモデルによる音声強調」，日本音響学会2016年秋季研究発表会，富山，2016年9月
岸田拓也，中島祥好，上田和夫，Gerard B. Remijn，「主成分分析によって縮約し，再合成した日本語音声の明瞭度について」，ヒューマン情報処理研究会，福岡，2015年７月
Kishida, T., Nakajima, Y., Ueda, K., & Remijn, G. B., “Perceptual roles of power-fluctuation factors in Japanese speech”, 第48回知覚コロキウム，大分，2015年3月
Kishida, T., Nakajima, Y., Ueda, K., & Remijn, G. B., “Perceptual roles of power-fluctuation factors in speech perception: A new method of factor analysis”, 日本音響学会聴覚研究会，福岡，2014年12月
岸田拓也，中島祥好，上田和夫，「音声の因子分析と再合成～因子の除去が明瞭性に与える効果～」，日本音響学会聴覚研究会，宮城，2013年8月
岸田拓也，中島祥好，上田和夫，竹市博臣，「歌唱音声の多変量解析と再合成」，日本音響学会聴覚研究会，福岡，2012年12月（日本音響学会九州支部学生表彰受賞）
中島祥好，竹市博臣，木寺沙紀，上田和夫，岸田拓也，「朗読音声と歌唱音声との因子分析」，日本基礎心理学会第31回大会，福岡，2012年11月

Competitive Grants

日本学術振興会科学研究費補助金・基盤研究(C)、「深層エネルギーベースモデルによる創造的声質変換の研究」、課題番号：23K11161、研究代表者：岸田拓也　研究分担者：中鹿亘、期間：2023-2025年度、配分額：4,680千円
日本学術振興会科学研究費補助金・基盤研究(C)、「音響的分析と聞き手の心理評価に基づく表情豊かな英語スピーチ力の育成」、課題番号：20K00842、研究代表者：山下友子　研究分担者：飯田朱美、冬野美晴、C.P. Brocklebank、島崎みどり、岸田拓也、中鹿亘、期間：2020-2024年度、配分額：4,550千円
日本学術振興会科学研究費補助金・若手研究、「対面コミュニケーションと同等に感情を伝えるための音声強調処理法の開発」、課題番号：19K20618、研究代表者：岸田拓也、期間：2019-2022年度、配分額：4,160千円
九州大学研究活動基礎支援制度平成３0年度「国際学会派遣支援」，配分額：270千円
日本学術振興会科学研究費補助金・特別研究員奨励費、「音声情報圧縮の基礎研究：言語音のスペクトル変化と発声器官の動きとの同時観測」，課題番号：16J05172，研究代表者：岸田拓也, 期間： 2016–2017年度，配分額：1,900千円

Patents

「音声伝達環境評価システム及び感覚刺激提示装置」、発明者名：中島祥好、上田和夫、Gerard B. Remijn、岸田拓也、小山昭則、田中悠、特許出願人：九州大学、特開 2020-071306、国内、公開年月日: 2020年5月7日
「音声のコーディング・デコーディング方法」、発明者名：松尾行雄、中島祥好、上田和夫、岸田拓也、特許出願人：学校法人東北学院、特開 2018-146652、国内、公開月日: 2018年9月20日
「信号解析装置，方法，及びプログラム」、亀岡弘和、岸田拓也、特許出願人：日本電信電話株式会社、特開 2018-36410、国内、公開年月日: 2018年3月8日

Awards

2013年3月　平成24年度日本音響学会九州支部学生表彰受賞

Takuya Kishida, Ph.D. (Design)

Takuya Kishida, Ph.D. (Design)

PUBLICATIONS (last update 2023.4.25)