Research Scientist
Adobe Research, San Jose, CA, US



mysmilesh@gmail.com
[CV]  [Google Scholar]  [GitHub]  [LinkedIn]  [twitter]



I am a Research Scientist at Adobe Research. My research interests are in the areas of machine learning and natural language processing (NLP). I am particularly interested in understanding long texts for question answering systems and learning language representation for NLP tasks. Further interests lie in applying and integrating NLP research with other disciplines to tackle practical issues; understanding multimodal information (i.e., text, audio, and visual) and NLP for social good.

I received my Ph.D. in Electrical and Computer Engineering from Seoul National University in 2020 with the Distinguished Dissertation Award, where I was fortunate to be advised by Dr. Kyomin Jung. Prior to Seoul National University, I had involved critical initiatives for the engineering and innovation of AI and machine learning while I was a staff software engineer at Samsung Research Artificial Intelligence Center (2006-2017).


News

  • *new* [05/2023] One paper (Transcript Understanding) is accepted to Interspeech 2023.
  • *new* [05/2023] Two papers (HighGEN, MeetingQA) are accepted to ACL 2023.
  • [01/2023] One paper is accepted to EACL 2023.
  • [10/2022] One paper is accepted to NeurIPS 2022 Workshop on All Things Attention.
  • [10/2022] One paper (GeNER) is accepted to EMNLP 2022.
  • [09/2202] I gave a talk at SKKU, "Pretrained Language Model and Semantic Textual Understanding"
  • [08/2022] Five papers (KGQA, MedicalQA, Offensive content detection, Keyphrase extraction, Acronym extraction) are accepted to COLING 2022.
  • [08/2022] Our patent (language model) has issued.
  • [08/2022] I gave a talk at SNU GoGE Workshop 2022, "Semantic Textual Understanding for Information Retrieval"
  • [04/2022] Two papers (image captioning, multimodal intent discovery) are accepted to NAACL 2022 Findings.
  • [04/2022] Our paper (fake news detection) is accepted to ACL CONSTRAINT 2022.
history

Academic Activities

  • Service:
    Program Committee, NAACL (2019, 2021), ACL (2020, 2021), EMNLP (2019, 2020, 2021, 2022), AACL (2020, 2022), EACL (2021, 2023), COLING (2022)
    Program Committee, AAAI (2020, 2021, 2022, 2023), WWW (2021), INTERSPEECH (2019)
    Journal Reviewer, Information Processing and Management, 2020
    Journal Reviewer, IEEE Signal Processing Letters, 2020

  • Invited Talks:
    Pretrained Language Model and Semantic Textual Understanding, SKKU, Sep. 2022
    Semantic Textual Understanding for Information Retrieval, Seoul National Univ., Aug. 2022
    Mutimodal Evaluation Metric and Image Captioning Model, Korea Univ., Dec. 2021
    Recent Advancements in NLP for QA, LM, and Evaluation Metric, Dongguk Univ., Sep. 2020
    Understanding Long Texts for Question Answering System Using DNN, KAIST/IBS, Jul. 2020
    Question Answering System for Long Text, Adobe Research (San Jose, CA, US), Dec. 2019
    Question Answering System and Multimodal Speech Emotion Recognition, DEEPEST, Aug. 2019
    Research in Natural Language Processing, NVIDIA AI Conference, Jul. 2019
    Question Answering for Short Answer, Adobe Research (San Jose, CA, US), Dec. 2018
    QA-pair ranking algorithm and its applications, NAVER, Aug. 2018
    Learning to Rank Question-Answer Pairs, PyTorch KR, Jun. 2018
    Advancement of the Neural Dialogue Model, Fast campus, Jul. 2018

  • Teaching Assistant:
    Programming Methodology, Seoul National University, Spring 2018
    Machine Learning, Seoul National University, Fall 2015
    Lab. Sentiment Analysis, BigCamp (Big Data Academy), Big Data Institute, 2016-2019


Professional Experiences

  • NLP Research Scientist: Adobe Research (San Jose, CA, US), 2020-present
  • Research Scientist Intern: Adobe Research (San Jose, CA, US), Fall 2018
  • Staff Engineer: Samsung Research (Seoul, KR), 2006-2017
  • Representative of employees: Samsung Electronics (Seoul, KR), 2012-2014
  • Trainer of Global New Employee Course: Samsung Electronics (Seoul, KR), Spring 2011

Publications

    [2023]

  1. Boosting Punctuation Restoration with Data Generation and Reinforcement Learning [pdf]
    Viet Lai, Abel Salinas, Hao Tan, Trung Bui, Quan Hung Tran, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Thien Nguyen
    Interspeech 2023

  2. Automatic Creation of Named Entity Recognition Datasets by Querying Phrase Representations [pdf]
    Hhynjae Kim, Jaehyo Yoo, Seunghyun Yoon, Jaewoo Kang
    ACL 2023

  3. MEETINGQA: Extractive Question-Answering on Meeting Transcripts
    Archiki Prasad, Trung Bui, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Mohit Bansal
    ACL 2023

  4. PiC: A Phrase-in-Context Dataset for Phrase Understanding and Semantic Search [pdf] [project page]
    Thang M. Pham, Seunghyun Yoon, Trung Bu, Anh Nguyeng
    EACL 2023

  5. [2022]

  6. Factual Error Correction for Abstractive Summaries Using Entity Retrieval [pdf]
    Hwanhee Lee, Cheoneum Park, Seunghyun Yoon, Trung Bu, Franck Dernoncourt, Juae Kim, Kyomin Jung
    EMNLP 2022 Workshop on GEM

  7. Improving cross-modal attention via object detection [pdf]
    Yongil Kim, Yerin Hwang, Seunghyun Yoon, Hyeongu Yun, Kyomin Jung
    NeurIPS 2022 Workshop on All Things Attention

  8. Simple Questions Generate Named Entity Recognition Datasets [pdf] [code]
    Hyunjae Kim, Jaehyo Yoo, Seunghyun Yoon, Jinhyuk Lee, Jaewoo Kang
    EMNLP 2022

  9. Virtual Knowledge Graph Construction for Zero-Shot Domain-Specific Document Retrieval [pdf]
    Yeon Seonwoo, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Alice Oh
    COLING 2022

  10. Medical Question Understanding and Answering with Knowledge Grounding and Semantic Self-Supervision [pdf]
    Khalil Mrini, Harpreet Singh, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter W. Chang, Emilia Farcas, Ndapa Nakashole
    COLING 2022

  11. Offensive Content Detection Via Synthetic Code-Switched Text [pdf]
    Cesa Salaam, Franck Dernoncourt, Trung Bui, Seunghyun Yoon
    COLING 2022

  12. Keyphrase Prediction from Video Transcripts: New Dataset and Directions [pdf]
    Amir Pouran Ben Veyseh, Quan Hung Tran, Seunghyun Yoon, Varun Manjunatha, Hanieh Deilamsalehy, Rajiv Jain, Trung Bui, Walter W. Chang, Franck Dernoncourt, Thien Huu Nguyen
    COLING 2022

  13. MACRONYM: A Large-Scale Dataset for Multilingual and Multi-Domain Acronym Extraction [pdf]
    Amir Pouran Ben Veyseh, Nicole Meister, Seunghyun Yoon, Rajiv Jain, Franck Dernoncourt, Thien Huu Nguyen
    COLING 2022

  14. Fine-grained Image Captioning with CLIP Reward [pdf] [code] [demo]
    Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, M Bansal
    NAACL Findings 2022

  15. Multimodal Intent Discovery from Livestream Videos [pdf] [code]
    Adyasha Maharana, Quan Tran, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Walter Chang, M Bansal
    NAACL Findings 2022

  16. How does fake news use a thumbnail? CLIP-based Multimodal Detection on the Unrepresentative News Image [pdf]
    Hyewon Choi, Yejun Yoon, Seunghyun Yoon, Kunwoo Park
    ACL CONSTRAINT 2022

  17. CAISE: Conversational Agent for Image Search and Editing [pdf] [code]
    Hyounghun Kim, Doo Soon Kim, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Mohit Bansal
    AAAI 2022

  18. [2021]

  19. Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning [pdf]
    J Zhang, T Bui, S Yoon, X Chen, Z Liu, C Xia, QH Tran, W Chang, P Yue
    EMNLP 2021

  20. QACE: Asking Questions to Evaluate an Image Caption [pdf] [code]
    H Lee, T Scialom, S Yoon, F Dernoncourt, K Jung
    EMNLP Findings 2021

  21. A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question Understanding [pdf]
    K Mrini, F Dernoncourt, S Yoon, T Bui, W Chang, E Farcas, N Nakashole
    ACL 2021

  22. UMIC: An Unreferenced Metric for Image Captioning via Contrastive Learning [pdf] [code]
    H Lee, S Yoon, F Dernoncourt, T Bui, K Jung
    ACL 2021

  23. UCSD-Adobe at MEDIQA 2021: Transfer Learning and Answer Sentence Selection for Medical Summarization [pdf]
    K Mrini, F Dernoncourt, S Yoon, T Bui, W Chang, E Farcas, N Nakashole
    NAACL BioNLP 2021

  24. KPQA: A Metric for Generative Question Answering Using Keyphrase Weights [pdf] [code / checkpoint]
    H Lee, S Yoon, F Dernoncourt, DS Kim, T Bui, J Shin, K Jung
    NAACL 2021

  25. Learning to Detect Incongruence in News Headline and Body Text via a Graph Neural Network [pdf] [code]
    (SCIE, IF=3.745)
    S Yoon*, K Park*, M Lee, T Kim, M Cha, K Jung
    IEEE Access 2021

  26. [2020]

  27. Collaborative Training of GANs in Continuous and Discrete Spaces for Text Generation [pdf]
    (SCIE, IF=3.745)
    Y Kim, S Won, S Yoon, K Jung
    IEEE Access 2020

  28. ViLBERTScore: Evaluating Image Caption Using Vision-and-Language BERT [pdf] [code / checkpoint]
    H Lee, S Yoon, F Dernoncourt, DS Kim, T Bui, K Jung
    EMNLP Eval4NLP 2020

  29. Multimodal Speech Emotion Recognition using Cross Attention with Aligned Audio and Text [pdf]
    Y Lee, S Yoon, K Jung
    INTERSPEECH 2020

  30. Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning [pdf] [code / checkpoint]
    (acceptance rate: 25.2%)
    J Shin, Y Lee, S Yoon, K Jung
    ACL 2020

  31. Propagate-Selector: Detecting Supporting Sentences for Question Answering via Graph Neural Networks [pdf] [code]
    S Yoon, F Dernoncourt, DS Kim, T Bui, K Jung
    LREC 2020

  32. Drug-disease Graph: Predicting Adverse Drug Reaction Signals via Graph Neural Network with Clinical Data [pdf] [slide]
    (oral presentation, accpetance rate: 21%)
    H Kwak, M Lee, S Yoon, J Chang, S Park, K Jung
    PAKDD 2020

  33. DSTC8-AVSD: Multimodal Semantic Transformer Network with Retrieval Style Word Generator [pdf]
    H Lee, S Yoon, F Dernoncourt, DS Kim, T Bui, K Jung
    AAAI 2020 DSTC8

  34. Comparative Studies on Machine Learning for Paralinguistic Signal Compression and Classification [pdf]
    (SCI, IF=2.157)
    S Byun*, S Yoon*, K Jung
    Journal of Supercomputing 2020

  35. Attentive Modality Hopping Mechanism for Speech Emotion Recognition [pdf] [code] [slide]
    (oral presentation)
    S Yoon, S Dey, H Lee, K Jung
    IEEE ICASSP 2020

  36. BaitWatcher: A lightweight web interface for the detection of incongruent news headlines [pdf] [book]
    K Park, T Kim, S Yoon, M Cha, K Jung
    Disinformation, Misinformation, and Fake News in Social Media-Emerging Research Challenges and Opportunities, Springer 2020

  37. [2019]

  38. A Compare-Aggregate Model with Latent Clustering for Answer Selection [pdf] [slide] [poster]
    (oral presentation, accpetance rate: 21.2%)
    S Yoon, F Dernoncourt, DS Kim, T Bui, K Jung
    CIKM 2019

  39. Surf at MEDIQA 2019: Improving Performance of Natural Language Inference in the Clinical Domain by Adopting Pre-trained Language Model [pdf] [poster]
    J Nam, S Yoon, K Jung
    ACL BioNLP 2019

  40. Speech Emotion Recognition Using Multi-hop Attention Mechanism [pdf] [slide]
    (oral presentation)
    S Yoon, S Byun, S Dey, K Jung
    IEEE ICASSP 2019

  41. Neural Networks for Compressing and Classifying Speaker-Independent Paralinguistic Signals [pdf]
    S Byun, S Yoon, K Jung
    IEEE BigComp 2019

  42. Detecting Incongruity Between News Headline and Body Text via a Deep Hierarchical Encoder [pdf] [code] [slide] [poster]
    (oral presentation, accpetance rate: 16.2%)
    S Yoon*, K Park*, J Shin, H Lim, S Won, M Cha, K Jung
    AAAI 2019

  43. [2018 and earlier]

  44. Multimodal Speech Emotion Recognition using Audio and Text [pdf] [code] [poster]
    S Yoon, S Byun, K Jung
    IEEE SLT 2018

  45. Comparative Studies of Detecting Abusive Language on Twitter [pdf] [code]  
    Y Lee*, S Yoon*, K Jung
    EMNLP ALW 2018

  46. Learning to Rank Question-Answer Pairs using Hierarchical Recurrent Encoder with Latent Topic Clustering [pdf] [code] [poster] [video_kor]
    (acceptance rate: 31%)
    S Yoon, J Shin, K Jung
    NAACL 2018

  47. Contextual-CNN: A Novel Architecture Capturing Unified Meaning for Sentence Classification [pdf]
    J Shin, Y Kim, S Yoon, K Jung
    IEEE BigComp 2018

  48. Synonym Discovery with Etymology-based Word Embeddings [pdf]
    S Yoon, P Estrada, K Jung
    IEEE SSCI 2017

  49. Efficient Transfer Learning Schemes for Personalized Language Modeling using Recurrent Neural Network [pdf]
    S Yoon, H Yun, Y Kim, G Park, K Jung
    AAAI 2017 (Workshop)

  50. Automatic Question Answering System for Consumer Product [pdf]
    S Yoon, M Sundar, A Gupta, K Jung
    IntelliSys 2016

  51. Mining the Minds of Customers from Online Chat Logs [pdf]
    (accpetance rate: 21%)
    K Park, J Kim, J Park, M Cha, J Nam, S Yoon, E Rhim
    CIKM 2015

  52. Domain Question Answering System
    S Yoon, E Rhim, D Kim
    KIISE Transactions on Computing Practices 2015

  53. Media clips: Implementation of an intuitive media linker [pdf]
    S Yoon, K Lee, H Shin
    IEEE BMSB 2011


Patents

[ International Patents ]

  1. [issued] Utilizing a graph neural network to identify supporting text phrases and generate digital query responses [link]
    S Yoon, F Dernoncourt, DS Kim, T Bui
    US 11,271,876, Mar. 8, 2022

  2. [issued] Utilizing bi-directional recurrent encoders with multi-hop attention for speech emotion recognition [link]
    T Bui, S Dey, S Yoon
    US 11,205,444, Dec. 21, 2021

  3. [issued] Answer selection using a compare-aggregate model with language model and condensed similarity information from latent clustering [link]
    S Yoon, F Dernoncourt, T Bui, DS Kim, CI Dockhorn, Y Gong
    US 11,113,323, Sep. 7, 2021

  4. [issued] Terminal apparatus, server and method of controlling the same [link]
    Y Kim, O Kwon, S Kim, H Oh, S Yoon, S Cha, J Lee
    US 10,084,850, CN 201410085759, EP20140154718, Sep. 25, 2018

  5. Method and device for analyzing user's emotion [link]
    E Rhim, J Kim, J Nam, S Yoon, K Park, J Park, M Cha
    WO2016182393, May. 13, 2016

  6. [issued] Method of recommending application, mobile terminal using the method, and communication system using the method [link]
    J Nam, M Lee, M Koo, S Yoon
    US 9,247,376, Jan. 26, 2016  

  7. [issuedMethod and apparatus for displaying photo on screen having any shape [link]
    S Yoon, M Lee
    US 9,049,383, Jun. 2, 2015  

  8. [issuedMethod and apparatus for providing information and computer readable storage medium having a program recorded thereon for executing the method [link]
    S Yoon, M Lee, M Koo, J Nam
    US 8,958,824, Feb. 17, 2015  

  9. [issuedApparatus and method for clipping and sharing content at a portable terminal [link]
    S Yoon, M Lee, M Koo, J Nam
    US 13/629,394, CN103827913A, EP20120837007, PCT/KR1020110097578, May. 28, 2014

  10. [issuedMethod and apparatus for fast tracking position by using global positioning system [link]
    S Yoon, S Kim
    US 8,094,070, Jan. 10, 2012  

[ Korean Patents ]

  1. [issuedApparatus and method for evaluating sentense by using bidirectional language model
    K Jung, J Shin, S Yoon
    KR 10-2436900, Aug. 23, 2022

  2. [issuedMethod and apparatus for emotion recognition based on cross attentionmodel
    K Jung, Y Lee, S Yoon
    KR 10-2365433, Feb. 16, 2022

  3. [issuedArtificial intelligence based dialog system and response control method thereof
    K Jung, S Yoon, J Shin, H Kwak, S Byun
    KR 10-2059015, Dec. 18, 2019

  4. [issuedTerminal apparatus, server and method of controlling the same
    Y Kim, O Kwon, S Kim, H Oh, S Yoon, S Cha, J Lee
    KR 10-1832394, Feb. 20, 2018

  5. [issuedApparatus and method for collecting information of destination in portable terminal
    S Yoon, J Nam, M Koo, M Lee
    KR 10-1914632, Oct. 29, 2018

  6. [issuedMethod and apparatus for providing information, and computer readable storage medium
    S Yoon, M Lee, M Koo, J Nam
    KR 10-1773167, Aug. 24, 2017  

  7. [issued] Method for recommendation of application, mobile terminal thereof and communication system thereof
    J Nam, M Lee, M Koo, S Yoon
    KR 10-1747303, Jun. 8, 2017  

  8. Device and method for analyzing user emotion
    E Rhim, J Kim, J Nam, S Yoon, K Park, J Park, M Cha
    KR 1020160058782, May. 13, 2016  

  9. [issued] Method and apparatus for fast positioning using global positioning system
    S Yoon, S Kim
    KR 10-1564938, Oct. 27, 2015