I am Sakae Mizuki (水木栄), Ph.D. graduate in Computer Science (Natural Language Processing) from Institute of Science Tokyo (formerly Tokyo Institute of Technology), where I had the honor of being supervised by Prof. Naoaki Okazaki. Having completed my doctoral studies, I now continue my career in the R&D division at Hotto Link, Inc.

In parallel to my full-time work, I am also a part-time researcher at Institute of Science Tokyo and visiting researcher at National Institute of Advanced Industrial Science and Technology. My research interests focused on representation learning, lexical semantics, with a specific emphasis on integrating lexical knowledge into large language models.

Github: s-mizuki-nlp

Email: sakae.mizuki [aatt] nlp [dot] c.titech.ac.jp

Publications

Journal

Sakae Mizuki and Naoaki Okazaki. Learning Hierarchical Code Representation for Hypernymy Detection. Journal of Information Processing (TOD), 14(4):8–23, 2021. Paper

International Conference

Naoaki Okazaki, Kakeru Hattori, Hirai Shota, Hiroki Iida, Masanari Ohi, Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Rio Yokota, Sakae Mizuki. Building a Large Japanese Web Corpus for Large Language Models. In Proceedings of the First Conference on Language Modeling (COLM 2024), pp. (to appear), October 2024. Paper
Kazuki Fujii, Taishi Nakamura, Mengsay Loem, Hiroki Iida, Masanari Ohi, Kakeru Hattori, Hirai Shota, Sakae Mizuki, Rio Yokota, Naoaki Okazaki. Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities. Proceedings of the First Conference on Language Modeling (COLM 2024), pp. (to appear), October 2024. Paper
Shogo Matsuno, Sakae Mizuki, and Takeshi Sakaki. Construction of Evaluation Datasets for Trend Forecasting Studies. In Proceedings of the 17th International AAAI Conference on Web and Social Media (ICWSM 2023), pp. 1041-1051, June 2023. Paper, Dataset
Sakae Mizuki and Naoaki Okazaki. Semantic Specialization for Knowledge-based Word Sense Disambiguation. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2023), pp. 3457-3470, May 2023. Paper, Code
Sakae Mizuki and Naoaki Okazaki. Analyzing the Variation Property of Contextualized Word Representations. In AI 2019: Advances in Artificial Intelligence, pp. 393–405, December 2019. Paper

Domestic Conference

Kakeru Hattori, Naoaki Okazaki, Sakae Mizuki, …, and Hiroya Takamura. Swallow Corpus v2: Educational Japanese Web Text Corpus (Swallowコーパスv2: 教育的な日本語ウェブコーパスの構築). The 31st Annual Meeting of The Association for Natural Language Processing (NLP2025), March 2025. (in Japanese) Paper
Youmi Ma, Sakae Mizuki, Kazuki Fujii, …, and Naoaki Okazaki. Instruction Tuning of Large Language Models through Model Imitation (模倣学習による大規模言語モデルの指示チューニング). The 31st Annual Meeting of The Association for Natural Language Processing (NLP2025), March 2025. (in Japanese) Paper
Kakeru Hattori, Sakae Mizuki, Kazuki Fujii, …, and Naoaki Okazaki. Developing a Japanese LLM Enhanced for Current Affairs and Society by Leveraging Newspaper Articles (新聞記事からつくる時事と社会に強い日本語LLM). The 31st Annual Meeting of The Association for Natural Language Processing (NLP2025), March 2025. (in Japanese) Paper
Koshiro Saito, Sakae Mizuki, Masanari Ohi, …, and Naoaki Okazaki. Advantages of Training LLMs on Japanese Text (LLMに日本語テキストを学習させる意義). The 261st Special Interest Group on Natural Language Processing (SIG-NL), September 2024. (in Japanese) Paper, Best Paper Award
Kazuki Fujii, Taishi Nakamura, Mengsay Loem, …, Sakae Mizuki, Rio Yokota, and Naoaki Okazaki. Developing High-Performance Japanese Large-scale Language Models Using Continual Pre-training (継続事前学習による日本語に強い大規模言語モデルの構築). The 30th Annual Meeting of The Association for Natural Language Processing (NLP2024), March 2024. (in Japanese) Paper, Best Paper Award
Sakae Mizuki, Hiroki Iida, Kazuki Fujii, …, and Naoaki Okazaki. Efficient Japanese Language Capabilities Enhancement in Large-scale Language Models: Utilizing Vocabulary Expansion and Parallel Corpus in Continual Pre-training (大規模言語モデルの日本語能力の効率的な強化: 継続事前学習における語彙拡張と対訳コーパスの活用). The 30th Annual Meeting of The Association for Natural Language Processing (NLP2024), March 2024. (in Japanese) Paper
Naoaki Okazaki, Kakeru Hattori, Shota Hirai, …, and Sakae Mizuki. Swallow Corpus: Large-scale Japanese Web Text Corpus (Swallowコーパス: 日本語大規模ウェブコーパス). The 30th Annual Meeting of The Association for Natural Language Processing (NLP2024), March 2024. (in Japanese) Paper, Best Paper Award
Sakae Mizuki and Naoaki Okazaki. Semantic Specialization for Knowledge-based Word Sense Disambiguation. 29th Annual Meeting of The Association for Natural Language Processing (NLP2023), March 2023. (in Japanese) Paper, Best Paper Award
Ryogo Ishikawa, Ayana Niwa, Sakae Mizuki, Naoaki Okazaki. Robust Dependency Parsing for the Omission of a Post-positional Particle using Pseudo Training Data. The 28th Annual Meeting of the Association for Natural Language Processing (NLP2022), March 2022. (in Japanese) Paper
Sakae Mizuki and Naoaki Okazaki. Hyponymy Detection using Hierarchical Code Learning. 27th Annual Meeting of The Association for Natural Language Processing (NLP2021), pp. 1236–1241, March 2021. (in Japanese) Paper, Best Paper Award
Sakae Mizuki and Takeshi Sakaki. General-Purpose Oriented Extended Named Entity Labeling of Wikipedia Entries. IEICE Technical Report, vol. 117, no. 82, NLC2017-9, pp. 47-52, June 2017. (in Japanese) Paper, Best Paper Award

Articles

Sakae Mizuki. Behind the Scenes of “Semantic Specilization for Knowledge-based Word Sense Disambiguation” (「埋め込み表現の意味適応による知識ベース語義曖昧性解消」ができるまで). Journal of Natural Language Processing, vol. 30, no. 3, pp. 1105-1109, September 2023. (in Japanese) Article

Others

For a full list of publications including research papers written as part of office duties, refer to my Google Scholar

Honors and Awards

Sep. 2024: Best Paper Award (co-authored), the 261st Special Interest Group on Natural Language Processing (SIG-NL)
Mar. 2024: Best Paper Award (co-authored two papers), the 30th Annual Meeting of The Association for Natural Language Processing (NLP2024)
Mar. 2023: Best Paper Award, the 29th Annual Meeting of The Association for Natural Language Processing (NLP2023)
Mar. 2021: Best Paper Award, the 27th Annual Meeting of The Association for Natural Language Processing (NLP2021)
Jan. 2018: Best Paper Award, IEICE Technical Committee on Natural Language Understanding and Models of Communication (NLC)

Education

Apr. 2018 - Dec. 2023: Ph.D. of Engineering (Computer Science), Institute of Science Tokyo (formerly Tokyo Institute of Technology), Japan
- Research field: Natural Language Processing
Apr. 2007 - Mar. 2009: Master of Engineering, Tokyo University, Japan
- Research field: Aerospace Engineering
Apr. 2003 - Mar. 2007: Bachelor of Engineering, Nagoya University, Japan
- Research field: Aerospace Engineering

Work Experience

Jul. 2024 - Present: National Institute of Advanced Industrial Science and Technology
- Visiting Researcher
Jan. 2024 - Present: Institute of Science Tokyo
- Part-time Researcher
Mar. 2015 - Present: Hotto Link Inc.
- Research Engineer
Apr. 2009 - Feb. 2015: Mizuho Bank, Ltd. (seconded to Mizuho-DL Financial Technology Co., Ltd.)
- Financial Engineer

Projects

Oct. 2023 - Present: Swallow LLM: large-scale language model research and development program

Talks

Aug. 2024: Paper Reading Seminar. Instruction-tuned Language Models are Better Knowledge Learners. In: 最先端NLP勉強会 2024. Slide
Jan. 2024: LLM Seminar. 大規模言語モデルSwallow. In: LLM勉強会.
Sep. 2023: LLM Seminar. Model imitationによるInstruction tuningのサーベイ. In: LLM勉強会.
Mar. 2023: Invited talk. 埋め込み表現の意味適応による知識ベース語義曖昧性解消. In: NLP Colloquium.
Mar. 2022: Workshop Lightning Talk. SNSを出典とする言語資源の公開にまつわるノウハウ. In: NLP2022 Co-located Workshop 日本語における評価用データセットの構築と利用性の向上 (JED2022).
Sep. 2021: Paper Reading Seminar. A Distributional Approach to Controlled Text Generation. In: 最先端NLP勉強会 2021. Slide
Sep. 2020: Paper Reading Seminar. ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators. In: 最先端NLP勉強会 2020. Slide
Sep. 2019: Paper Reading Seminar. Ordered neurons: Integrating tree structures into recurrent neural networks. In: 最先端NLP勉強会 2019. Slide
Aug. 2018: Paper Reading Seminar. Sequence-to-Action: End-to-End Semantic Graph Generation for Semantic Parsing. In: 最先端NLP勉強会 2018. Slide
Mar. 2016: Workshop. ソーシャルメディア分析サービスにおけるNLPに関する諸問題について. In: NLP2016 Co-located Workshop 論文に書かない（書けない）自然言語処理.

Membership

The Association for Natural Language Processing (NLP)
Information Processing Society of Japan (IPSJ)
The Securities Analysts Association of Japan (SAAJ)
LLM-jp: LLM勉強会

Skills

Japanese (native)
English (fluent)
Python, PyTorch (, R and C++)
Elasticsearch
Data science