Kyoto University, School of Informatics, Professor
From Apr. 2006, To Mar. 2020
National Institute of Information and Communications Technology (NICT), visiting researcher
From Apr. 2003, To Mar. 2016
Kyoto University, Academi Center for Computing & Media Studies, Professor
From Sep. 1998, To Mar. 2006
Advanced Telecommunications Research Institute International, 客員研究員
From Oct. 1999, To Mar. 2004
National Institute for Japanese Language and Linguistics, 非常勤研究員
From Apr. 1998, To Mar. 2003
Kyoto University, School of Informatics, Associate Professor
From Jun. 1995, To Apr. 1998
Kyoto University, Faculty of Engineering, Associate Professor
From Sep. 1995, To Aug. 1996
Bell Laboratories, Visiting Researcher
From Nov. 1990, To May 1995
Kyoto University, Faculty of Engineering, Research Associate
Profile
Profile
Tatsuya Kawahara received B.E. in 1987, M.E. in 1989, and Ph.D. in 1995, all in information science, from Kyoto University, Kyoto, Japan. From 1995 to 1996, he was a Visiting Researcher at Bell Laboratories, Murray Hill, NJ, USA. Currently, he is a Professor in the School of Informatics, Kyoto University. He has also been an Invited Researcher at ATR and NICT.
Dr. Kawahara is a board member of APSIPA and ISCA, and a Fellow of IEEE.
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference(APSIPA), 2020
Designing Precise and Robust Dialogue Response Evaluators.
Tianyu Zhao; Divesh Lala; Tatsuya Kawahara
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics(ACL), 2020
Fast Multichannel Nonnegative Matrix Factorization With Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation.
Graham Neubig; Taro Watanabe; Shinsuke Mori; Tatsuya Kawahara
Machine Translation, Jun. 2013, Peer-reviewed
Multi-party human-machine interaction using a smart multimodal digital signage
Tony Tung; Randy Gomez; Tatsuya Kawahara; Takashi Matsuyama
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2013, Peer-reviewed
Estimation of Interest and Comprehension Level of Audience through Multi-modal Behaviors in Poster Conversations
Admissible stopping in Viterbi beam search for unit selection speech synthesis
S.Sakai; T.Kawahara
IEICE Trans., 2013, Peer-reviewed
Combining Active Learning and Partial Annotation for Japanese Dependency Parsing
Daniel Flannery; 宮尾祐介; 森 信介; 河原 達也
言語処理学会年次大会発表論文集, 2013
オープンコースウェアの講演を対象とした音声認識に基づく字幕付与
秋田 祐哉; 河原 達也
日本音響学会研究発表会講演論文集, 2013
CSJを用いた日本語講演音声認識用DNN-HMMの構築
三村 正人; 河原 達也
日本音響学会研究発表会講演論文集, 2013
Automatic transcription of Chinese spoken lectures
Sheng Li; Masato Mimura; Tatsuya Kawahara
日本音響学会研究発表会講演論文集, 2013
音声認識を用いたオンライン自動字幕作成・編集システム
秋田 祐哉; 河原 達也
日本音響学会研究発表会講演論文集, 2013
[招待講演] 音声対話システムの進化と淘汰
河原 達也
人工知能学会研究会資料, 2013, Invited
[特別講演] スマートポスターボード: ポスター発表における場のマルチモーダルなセンシングと認識
河原 達也
電子情報通信学会技術研究報告, 2013, Invited
音声認識の方法論に関する考察―歴史的変遷と今後の展望―
河原 達也
情報処理学会研究報告, 2013, Invited
述語項構造を介したWebテキストからの文選択に基づく言語モデルの評価
吉野 幸一郎; 森 信介; 河原 達也
情報処理学会研究報告, 2013
CSJを用いた日本語講演音声認識へのDNN-HMMの適用と話者適応の検討
三村 正人; 河原 達也
情報処理学会研究報告, 2013
ポスター会話における聴衆のマルチモーダルな振る舞いに基づく 興味・理解度の推定
河原 達也; 林 宗一郎; 高梨 克也
情報処理学会研究報告, 2013
A monotonic statistical machine translation approach to speaking style transformation
Graham Neubig; Yuya Akita; Shinsuke Mori; Tatsuya Kawahara
COMPUTER SPEECH AND LANGUAGE, Oct. 2012, Peer-reviewed
Captioning for Lectures Using Automatic Speech Recognition Technology
Tatsuya Kawahara
Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 01 Sep. 2012, Peer-reviewed
Group dynamics and multimodal interaction modeling using a smart digital signage
Tony Tung; Randy Gomez; Tatsuya Kawahara; Takashi Matsuyama
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2012, Peer-reviewed
Multi-party Human-Robot Interaction with Distant-Talking Speech Recognition
Randy Gomez; Tatsuya Kawahara; Keisuke Nakamura; Kazuhiro Nakadai
HRI'12: PROCEEDINGS OF THE SEVENTH ANNUAL ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2012, Peer-reviewed
Designing an Evaluation Framework for Spoken Term Detection and Spoken Document Retrieval at the NTCIR-9 SpokenDoc Task
Speech Recognizer-based Optimization for Dereverberation Technique Using Multi-band Spectral Subtraction
Randy Gomez; Tatsuya Kawahara
日本音響学会研究発表会講演論文集, 2008
An Application of Online VB-EM Algorithm to Voice Activity Detection
David Cournapeau; Tatsuya Kawahara; Shinji Watanabe; Atsushi Nakamura
日本音響学会研究発表会講演論文集, 2008
Optimizing Scoring System for a Japanese Tutor System
Hongcui Wang; Tatsuya Kawahara
日本音響学会研究発表会講演論文集, 2008
ポスター会話における聞き手反応のマルチモーダルな分析
瀬戸口 久雄; 高梨 克也; 河原 達也
人工知能学会研究会資料, 2008
話し言葉の整形作業における削除箇所の自動同定
尾嶋 憲治; 河原 達也; 秋田 祐哉; 内元 清貴
情報処理学会研究報告, 2008
テキストと音声を用いた単語と読みの自動獲得
笹田 鉄郎; 森 信介; 河原 達也
情報処理学会研究報告, 2008
大学講義のノートテイク支援のための音声認識用言語モデルの適応
勝丸 徳浩; 秋田 祐哉; 森 信介; 河原 達也
情報処理学会研究報告, 2008
同時通訳者の知識と韻律情報を用いた講演文章のチャンキング
清水 徹; 中村 哲; 河原 達也
情報処理学会研究報告, 2008
Robust Speech Recognition in Reverberant Environment by Optimizing Multi-band Spectral Subtraction
Randy Gomez; Tatsuya Kawahara
人工知能学会研究会資料, 2008
ポスター会話におけるあいづちの形態的・韻律的な特徴分析と 会話モード間との相関の分析
常 志強; 高梨 克也; 河原 達也
人工知能学会研究会資料, 2008
会議録作成支援のための国会審議の音声認識システム
秋田 祐哉; 三村 正人; 河原 達也
電子情報通信学会技術研究報告, 2008
音声翻訳単位の推定における句読点情報の効果
清水 徹; 中村 哲; 河原 達也
電子情報通信学会技術研究報告, 2008
Evaluation of Real-time Voice Activity Detection based on High Order Statistics
David Cournapeau; Tatsuya Kawahara
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, Peer-reviewed
PLSA-based Topic Detection in Meetings for Adaptation of Lexicon and Language Model
Yuya Akita; Yusuke Nemoto; Tatsuya Kawahara
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, Peer-reviewed
Bayes Risk-based Optimization of Dialogue Management for Document Retrieval System with Speech Interface
Teruhisa Misu; Tatsuya Kawahara
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, Peer-reviewed
Gaussian Mixture Optimization for HMM based on Efficient Cross-validation
Takahiro Shinozaki; Tatsuya Kawahara
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, Peer-reviewed
音声認識技術の現状と会議録作成への適用可能性 (1)
河原 達也
日本の速記, 2007, Invited
音声認識技術の現状と会議録作成への適用可能性 (2)
河原 達也
日本の速記, 2007, Invited
Intelligent transcription system based on spontaneous speech processing
Tatsuya Kawahara
ICKS 2007: SECOND INTERNATIONAL CONFERENCE ON INFORMATICS RESEARCH FOR DEVELOPMENT OF KNOWLEDGE SOCIETY INFRASTRUCTURE, PROCEEDINGS, 2007, Peer-reviewed
Topic-Independent Speaking-style Transformation of Language Model for Spontaneous Speech Recognition
Y.Akita; T.Kawahara
Proc. IEEE-ICASSP, 2007, Peer-reviewed
Automatic Detection of Sentence and Clause Units using Local Syntactic Dependency
T.Kawahara; M.Saikou; K.Takanashi
Proc. IEEE-ICASSP, 2007, Peer-reviewed
Speech-based Interactive Information Guidance System Using Question-Answering Technique
T.Misu; T.Kawahara
Proc. IEEE-ICASSP, 2007, Peer-reviewed
An Interactive Framework for Document Retrieval and Presentation with Question-Answering Function in Restricted Domain
T.Misu; T.Kawahara
Proc. Int'l Conf. Industrial Engineering \& Other Applications of Artificial Intelligent Systems (IEA/AIE) (LNAI 4570), 2007, Peer-reviewed
Analyzing Temporal Transition of Real User's Behaviors in a Spoken Dialogue System
K.Komatani; T.Kawahara; H.G.Okuno
Proc. INTERSPEECH, 2007, Peer-reviewed
PLSA-based Topic Detection in Meetings for Adaptation of Lexicon and Language Model
Y.Akita; Y.Nemoto; T.Kawahara
Proc. INTERSPEECH, 2007, Peer-reviewed
Gaussian Mixture Optimization for HMM based on Efficient Cross-validation
T.Shinozaki; T.Kawahara
Proc. INTERSPEECH, 2007, Peer-reviewed
Evaluating and Optimizing Japanese Tutor System Featuring Dynamic Question Generation and Interactive Guidance
A Study of Effective Utilization for Agricultural Household Survey by Former Snow Area Branch Office of National Research Institute of Agricultural Economics
Automatic Transformation of GDA Document Tag and Development of Its Applications
Grant-in-Aid for Scientific Research (B)
KYOTO UNIVERSITY
Hiroshi OKUNO
Project Closed
文書タグ;Global Data Annotation(GDA);MPEG-7;意味構造記述方式;会議録インデキシング;Linguistic Description Scheme;MPEG-7音楽記述子;プライバシー重視のアクセス機構;Global Data Annotation (GDA);Global Data Anotation(GDA);意味的情報検索;音声会議録ディジタルアーカイブ;話者インデキシング;プライバシー重視アクセス機構;匿名アクセス機構;要約生成;情報検索;意味構造記述, Document Tags;Global Data.Annotation (GDA);MPEG-7;Semantic Description Scheme;Lecture Minute Indexing;Linguistic Description Scheme;MPEG-7 Music Descriptor;Privacy-enhanced Access Control
Musical Information Processing by using Sound Ontology
Grant-in-Aid for Scientific Research (B)
KYOTO UNIVERSITY;Tokyo University of Science
Hiroshi OKUNO
Project Closed
音オントロジー;楽器音の音源同定;F0依存多次元正規分布;両耳間位相差による音源定位;楽器の階層的認識;教師なし学習;クラスタリング;決定木学習;FO依存多次元正規分布;楽器の断層的認識;楽音認識;音高依存関数;音源定位;対判別関数;階層的認識;環境音認識;楽音特徴抽出;音高によるテンプレート切替, Sound Ontology;Musical Instrument Identification;F0-dependent Multivariate Normal Distribution;Localization by Intramural Phase Difference;Hierarchical Recognition of Musical Instruments;Unsupervised Learning;Clustering;Decision Tree Learning
An Automatic Lecture Recording System with Recognizing the Situation of the Lecture
Grant-in-Aid for Scientific Research (B)
KYOTO UNIVERSITY
Katsuo IKEDA
Project Closed
講義の自動記録;教材データベース;動画像処理;実時間処理;複数情報の統合;講義映像の分割;講師の追跡;講義状況の認識;音声認識;インデックス付加;指示動作認識;説明箇所の推定;自動映像切替え;教材データベースの構築;音声処理;複数の情報の統合;話題への分割;カメラマンロボット;黒板記録, Automatic Lecture Recording;Lecture Video Database;Video Data Processing;Real-time Image Processing;Integration of Multiple Information;Division of Lecture Video;Lecturer Tracking;Recognition of Lecture Situation
Research on Understanding and Generating Dialogue by Integrated Processing of Speech, Language and Concept
Grant-in-Aid for Scientific Research on Priority Areas
KYOTO UNIVERSITY
Shuji DOSHITA
Project Closed
音声認識;自然言語処理;対話理解;概念処理;音声対話;音声処理;概念・知識処理;対話モデル;論理・パターンの融合;認知モデル;対話コーパス;論理・パターンの統合, Speech Recognition;Natural Language Understanding;Dialogue Understanding;Conceptual Information Processing;Spoken Dialogue
Robust speech understanding against inter-speaker variation and ungrammatical utterances based on high-accuracy speech recognition and semantic driven parsing method
Grant-in-Aid for General Scientific Research (B)
KYOTO UNIVERSITY
Shuji DOSHITA
Project Closed
音声認識;自然言語理解;意味解析;話者変動;ロバストパ-サ;ロバストパーサ, Speech Recognition;Natural Language Understanding;Semantic Analysis;Inter-speaker Variation;Robust Parser