Research Scientist
Adobe Research, San Jose, CA, US



mysmilesh@gmail.com
[CV]  [Google Scholar]  [GitHub]  [LinkedIn]  [twitter]



I am a Research Scientist at Adobe Research. My research interests are in the areas of machine learning and natural language processing (NLP). I am particularly interested in understanding long texts for question answering systems and learning language representation for NLP tasks. Further interests lie in applying and integrating NLP research with other disciplines to tackle practical issues; understanding multimodal information (i.e., text, audio, and visual) and NLP for social good.

I received my Ph.D. in Electrical and Computer Engineering from Seoul National University in 2020 with the Distinguished Dissertation Award, where I was fortunate to be advised by Dr. Kyomin Jung. Prior to Seoul National University, I had involved critical initiatives for the engineering and innovation of AI and machine learning while I was a staff software engineer at Samsung Research Artificial Intelligence Center (2006-2017).


News

  • *new* [02/2024] One paper (Video Summary) is accepted to CVPR 2024.
  • *new* [01/2024] One paper (Textual Representation) is accepted to EACL 2024 Findings.
  • *new* [11/2023] One paper (Video Topic Segmentation) is accepted to Multimedia Modelling 2024.
  • [10/2023] One paper (Transcript Understanding) is accepted to IEEE BigData 2023.
  • [10/2023] One paper (Image Captioning Metric) is accepted to EMNLP 2023 Findings.
  • One paper (Moment Detection) is accepted to ICCV 2023.
  • [05/2023] One paper (Transcript Understanding) is accepted to Interspeech 2023.
  • [05/2023] Two papers (HighGEN, MeetingQA) are accepted to ACL 2023.
  • [01/2023] One paper is accepted to EACL 2023.
history

Academic Activities

  • Service:
    Program Committee, NAACL (since 2019), ACL (since 2020), EMNLP (since 2019), AACL (since 2020), EACL (since 2021), COLING (since 2022), LREC (since 2023), ARR (since 2022)
    Program Committee, AAAI (since 2020), WWW (since 2021), INTERSPEECH (2019), ICLR (since 2023)
    Journal Reviewer, Information Processing and Management, 2020
    Journal Reviewer, IEEE Signal Processing Letters, 2020

  • Invited Talks:
    Pretrained Language Model and Semantic Textual Understanding, SKKU, Sep. 2022
    Semantic Textual Understanding for Information Retrieval, Seoul National Univ., Aug. 2022
    Mutimodal Evaluation Metric and Image Captioning Model, Korea Univ., Dec. 2021
    Recent Advancements in NLP for QA, LM, and Evaluation Metric, Dongguk Univ., Sep. 2020
    Understanding Long Texts for Question Answering System Using DNN, KAIST/IBS, Jul. 2020
    Question Answering System for Long Text, Adobe Research (San Jose, CA, US), Dec. 2019
    Question Answering System and Multimodal Speech Emotion Recognition, DEEPEST, Aug. 2019
    Research in Natural Language Processing, NVIDIA AI Conference, Jul. 2019
    Question Answering for Short Answer, Adobe Research (San Jose, CA, US), Dec. 2018
    QA-pair ranking algorithm and its applications, NAVER, Aug. 2018
    Learning to Rank Question-Answer Pairs, PyTorch KR, Jun. 2018
    Advancement of the Neural Dialogue Model, Fast campus, Jul. 2018

  • Teaching Assistant:
    Programming Methodology, Seoul National University, Spring 2018
    Machine Learning, Seoul National University, Fall 2015
    Lab. Sentiment Analysis, BigCamp (Big Data Academy), Big Data Institute, 2016-2019


Professional Experiences

  • NLP Research Scientist: Adobe Research (San Jose, CA, US), 2020-present
  • Research Scientist Intern: Adobe Research (San Jose, CA, US), Fall 2018
  • Staff Engineer: Samsung Research (Seoul, KR), 2006-2017
  • Representative of employees: Samsung Electronics (Seoul, KR), 2012-2014
  • Trainer of Global New Employee Course: Samsung Electronics (Seoul, KR), Spring 2011

Publications

    [arXiv]

  1. Understanding News Thumbnail Representativeness by Counterfactual Text-Guided Contrastive Language-Image Pretraining [pdf]
    Yejun Yoon, Seunghyun Yoon, Kunwoo Park

  2. PDFTriage: Question Answering over Long, Structured Documents [pdf]
    Jon Saad-Falcon, Joe Barrow, Alexa Siu, Ani Nenkova, Seunghyun Yoon, Ryan A. Rossi, Franck Dernoncourt

  3. Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning [pdf]
    Meryem M'hamdi, Jonathan May, Franck Dernoncourt, Trung Bui, Seunghyun Yoon

  4. MVMR: Evaluating Natural Language Video Localization Bias over Multiple Reliable Videos Pool [pdf]
    Nakyeong Yang, Minsung Kim, Seunghyun Yoon, Joongbo Shin, Kyomin Jung

  5. [2024]

  6. Scaling Up Video Summarization Pretraining with Large Language Models
    Dawit Mureja Argaw, Seunghyun Yoon, Fabian Caba Heilbron, Hanieh Deilamsalehy, Trung Bui, Zhaowen Wang, Franck Dernoncourt, Joon Son Chung
    CVPR 2024

  7. Fine-tuning CLIP Text Encoders with Two-step Paraphrasing [pdf]
    Hyunjae Kim, Seunghyun Yoon, Trung Bui, Handong Zhao, Quan Tran, Franck Dernoncourt, Jaewoo Kang
    EACL 2024 Findings

  8. Multi-Modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation [pdf]
    Linzi Xing, Quan Tran, Fabian Caba Heilbron, Franck Dernoncourt, Seunghyun Yoon, Zhaowen Wang, Trung Bui, Giuseppe Carenini
    Multimedia Modelling 2024

  9. [2023]

  10. Aspect-based Meeting Transcript Summarization: A Two-Stage Approach with Weak Supervision on Sentence Classification [pdf]
    Zhongfen Deng, Seunghyun Yoon, Trung Bui, Franck Dernoncourt, Quan Tran, Shuaiqi Liu, Wenting Zhao, Tao Zhang, Yibo Wang, Philip Yu
    IEEE BigData 2023

  11. Perturbation Robust Metric for Multi-Lingual Image Captioning [pdf]
    Yongil Kim, Yerin Hwang, Hyeongu Yun, Seunghyun Yoon, Trung Bui, Kyomin Jung
    EMNLP 2023 Findings

  12. Moment Detection in Long Tutorial Videos [pdf] [code]
    Ioana Croitoru, Simion-Vlad Bogolin, Samuel Albanie, Yang Liu, Zhaowen Wang, Seunghyun Yoon, Franck Dernoncourt, Hailin Jin, Trung Bui
    ICCV 2023

  13. Boosting Punctuation Restoration with Data Generation and Reinforcement Learning [pdf]
    Viet Lai, Abel Salinas, Hao Tan, Trung Bui, Quan Tran, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Thien Nguyen
    Interspeech 2023

  14. Automatic Creation of Named Entity Recognition Datasets by Querying Phrase Representations [pdf]
    Hhynjae Kim, Jaehyo Yoo, Seunghyun Yoon, Jaewoo Kang
    ACL 2023

  15. MEETINGQA: Extractive Question-Answering on Meeting Transcripts [pdf]
    Archiki Prasad, Trung Bui, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Mohit Bansal
    ACL 2023

  16. PiC: A Phrase-in-Context Dataset for Phrase Understanding and Semantic Search [pdf] [page]
    Thang M. Pham, Seunghyun Yoon, Trung Bu, Anh Nguyeng
    EACL 2023

  17. [2022]

  18. Factual Error Correction for Abstractive Summaries Using Entity Retrieval [pdf]
    Hwanhee Lee, Cheoneum Park, Seunghyun Yoon, Trung Bu, Franck Dernoncourt, Juae Kim, Kyomin Jung
    EMNLP 2022 Workshop on GEM

  19. Improving cross-modal attention via object detection [pdf]
    Yongil Kim, Yerin Hwang, Seunghyun Yoon, Hyeongu Yun, Kyomin Jung
    NeurIPS 2022 Workshop on All Things Attention

  20. Simple Questions Generate Named Entity Recognition Datasets [pdf] [code]
    Hyunjae Kim, Jaehyo Yoo, Seunghyun Yoon, Jinhyuk Lee, Jaewoo Kang
    EMNLP 2022

  21. Virtual Knowledge Graph Construction for Zero-Shot Domain-Specific Document Retrieval [pdf]
    Yeon Seonwoo, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Alice Oh
    COLING 2022

  22. Medical Question Understanding and Answering with Knowledge Grounding and Semantic Self-Supervision [pdf]
    Khalil Mrini, Harpreet Singh, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter W. Chang, Emilia Farcas, Ndapa Nakashole
    COLING 2022

  23. Offensive Content Detection Via Synthetic Code-Switched Text [pdf]
    Cesa Salaam, Franck Dernoncourt, Trung Bui, Seunghyun Yoon
    COLING 2022

  24. Keyphrase Prediction from Video Transcripts: New Dataset and Directions [pdf]
    Amir Pouran Ben Veyseh, Quan Tran, Seunghyun Yoon, Varun Manjunatha, Hanieh Deilamsalehy, Rajiv Jain, Trung Bui, Walter W. Chang, Franck Dernoncourt, Thien Huu Nguyen
    COLING 2022

  25. MACRONYM: A Large-Scale Dataset for Multilingual and Multi-Domain Acronym Extraction [pdf]
    Amir Pouran Ben Veyseh, Nicole Meister, Seunghyun Yoon, Rajiv Jain, Franck Dernoncourt, Thien Huu Nguyen
    COLING 2022

  26. Fine-grained Image Captioning with CLIP Reward [pdf] [code] [demo]
    Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, M Bansal
    NAACL 2022 Findings

  27. Multimodal Intent Discovery from Livestream Videos [pdf] [code]
    Adyasha Maharana, Quan Tran, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Walter Chang, M Bansal
    NAACL Findings 2022

  28. How does fake news use a thumbnail? CLIP-based Multimodal Detection on the Unrepresentative News Image [pdf]
    Hyewon Choi, Yejun Yoon, Seunghyun Yoon, Kunwoo Park
    ACL CONSTRAINT 2022

  29. CAISE: Conversational Agent for Image Search and Editing [pdf] [code]
    Hyounghun Kim, Doo Soon Kim, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Mohit Bansal
    AAAI 2022

  30. [2021]

  31. Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning [pdf]
    J Zhang, T Bui, S Yoon, X Chen, Z Liu, C Xia, QH Tran, W Chang, P Yue
    EMNLP 2021

  32. QACE: Asking Questions to Evaluate an Image Caption [pdf] [code]
    H Lee, T Scialom, S Yoon, F Dernoncourt, K Jung
    EMNLP 2021 Findings

  33. A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question Understanding [pdf]
    K Mrini, F Dernoncourt, S Yoon, T Bui, W Chang, E Farcas, N Nakashole
    ACL 2021

  34. UMIC: An Unreferenced Metric for Image Captioning via Contrastive Learning [pdf] [code]
    H Lee, S Yoon, F Dernoncourt, T Bui, K Jung
    ACL 2021

  35. UCSD-Adobe at MEDIQA 2021: Transfer Learning and Answer Sentence Selection for Medical Summarization [pdf]
    K Mrini, F Dernoncourt, S Yoon, T Bui, W Chang, E Farcas, N Nakashole
    NAACL BioNLP 2021

  36. KPQA: A Metric for Generative Question Answering Using Keyphrase Weights [pdf] [code]
    H Lee, S Yoon, F Dernoncourt, DS Kim, T Bui, J Shin, K Jung
    NAACL 2021

  37. Learning to Detect Incongruence in News Headline and Body Text via a Graph Neural Network [pdf] [code]
    (SCI, IF=3.745)
    S Yoon*, K Park*, M Lee, T Kim, M Cha, K Jung
    IEEE Access 2021

  38. [2020]

  39. Collaborative Training of GANs in Continuous and Discrete Spaces for Text Generation [pdf]
    (SCI, IF=3.745)
    Y Kim, S Won, S Yoon, K Jung
    IEEE Access 2020

  40. ViLBERTScore: Evaluating Image Caption Using Vision-and-Language BERT [pdf] [code]
    H Lee, S Yoon, F Dernoncourt, DS Kim, T Bui, K Jung
    EMNLP Eval4NLP 2020

  41. Multimodal Speech Emotion Recognition using Cross Attention with Aligned Audio and Text [pdf]
    Y Lee, S Yoon, K Jung
    INTERSPEECH 2020

  42. Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning [pdf]
    J Shin, Y Lee, S Yoon, K Jung
    ACL 2020

  43. Propagate-Selector: Detecting Supporting Sentences for Question Answering via Graph Neural Networks [pdf] [code]
    S Yoon, F Dernoncourt, DS Kim, T Bui, K Jung
    LREC 2020

  44. Drug-disease Graph: Predicting Adverse Drug Reaction Signals via Graph Neural Network with Clinical Data [pdf] [slide]
    (oral presentation)
    H Kwak, M Lee, S Yoon, J Chang, S Park, K Jung
    PAKDD 2020

  45. DSTC8-AVSD: Multimodal Semantic Transformer Network with Retrieval Style Word Generator [pdf]
    H Lee, S Yoon, F Dernoncourt, DS Kim, T Bui, K Jung
    AAAI 2020 DSTC8

  46. Comparative Studies on Machine Learning for Paralinguistic Signal Compression and Classification [pdf]
    (SCI, IF=2.157)
    S Byun*, S Yoon*, K Jung
    Journal of Supercomputing 2020

  47. Attentive Modality Hopping Mechanism for Speech Emotion Recognition [pdf] [code] [slide]
    (oral presentation)
    S Yoon, S Dey, H Lee, K Jung
    IEEE ICASSP 2020

  48. BaitWatcher: A lightweight web interface for the detection of incongruent news headlines [pdf] [book]
    K Park, T Kim, S Yoon, M Cha, K Jung
    Disinformation, Misinformation, and Fake News in Social Media-Emerging Research Challenges and Opportunities, Springer 2020

  49. [2019]

  50. A Compare-Aggregate Model with Latent Clustering for Answer Selection [pdf] [slide] [poster]
    (oral presentation)
    S Yoon, F Dernoncourt, DS Kim, T Bui, K Jung
    CIKM 2019

  51. Surf at MEDIQA 2019: Improving Performance of Natural Language Inference in the Clinical Domain by Adopting Pre-trained Language Model [pdf]
    J Nam, S Yoon, K Jung
    ACL BioNLP 2019

  52. Speech Emotion Recognition Using Multi-hop Attention Mechanism [pdf] [slide]
    (oral presentation)
    S Yoon, S Byun, S Dey, K Jung
    IEEE ICASSP 2019

  53. Neural Networks for Compressing and Classifying Speaker-Independent Paralinguistic Signals [pdf]
    S Byun, S Yoon, K Jung
    IEEE BigComp 2019

  54. Detecting Incongruity Between News Headline and Body Text via a Deep Hierarchical Encoder [pdf] [code]
    (oral presentation)
    S Yoon*, K Park*, J Shin, H Lim, S Won, M Cha, K Jung
    AAAI 2019

  55. [2018 and earlier]

  56. Multimodal Speech Emotion Recognition using Audio and Text [pdf] [code]
    S Yoon, S Byun, K Jung
    IEEE SLT 2018

  57. Comparative Studies of Detecting Abusive Language on Twitter [pdf] [code]
    Y Lee*, S Yoon*, K Jung
    EMNLP ALW 2018

  58. Learning to Rank Question-Answer Pairs using Hierarchical Recurrent Encoder with Latent Topic Clustering [pdf] [code]
    S Yoon, J Shin, K Jung
    NAACL 2018

  59. Contextual-CNN: A Novel Architecture Capturing Unified Meaning for Sentence Classification [pdf]
    J Shin, Y Kim, S Yoon, K Jung
    IEEE BigComp 2018

  60. Synonym Discovery with Etymology-based Word Embeddings [pdf]
    S Yoon, P Estrada, K Jung
    IEEE SSCI 2017

  61. Efficient Transfer Learning Schemes for Personalized Language Modeling using Recurrent Neural Network [pdf]
    S Yoon, H Yun, Y Kim, G Park, K Jung
    AAAI 2017 (Workshop)

  62. Automatic Question Answering System for Consumer Product [pdf]
    S Yoon, M Sundar, A Gupta, K Jung
    IntelliSys 2016

  63. Mining the Minds of Customers from Online Chat Logs [pdf]
    K Park, J Kim, J Park, M Cha, J Nam, S Yoon, E Rhim
    CIKM 2015

  64. Domain Question Answering System [pdf]
    S Yoon, E Rhim, D Kim
    KIISE Transactions on Computing Practices 2015

  65. Media clips: Implementation of an intuitive media linker [pdf]
    S Yoon, K Lee, H Shin
    IEEE BMSB 2011


Patents

[ International Patents ]

  1. [issued] Utilizing a graph neural network to identify supporting text phrases and generate digital query responses [link]
    S Yoon, F Dernoncourt, DS Kim, T Bui
    US 11,271,876, 8-Mar-2022

  2. [issued] Utilizing bi-directional recurrent encoders with multi-hop attention for speech emotion recognition [link]
    T Bui, S Dey, S Yoon
    US 11,205,444, 21-Dec-2021

  3. [issued] Answer selection using a compare-aggregate model with language model and condensed similarity information from latent clustering [link]
    S Yoon, F Dernoncourt, T Bui, DS Kim, CI Dockhorn, Y Gong
    US 11,113,323, 7-Sep-2021

  4. [issued] Terminal apparatus, server and method of controlling the same [link]
    Y Kim, O Kwon, S Kim, H Oh, S Yoon, S Cha, J Lee
    US 10,084,850, CN 201410085759, EP20140154718, 25-Sep-2018

  5. Method and device for analyzing user's emotion [link]
    E Rhim, J Kim, J Nam, S Yoon, K Park, J Park, M Cha
    WO2016182393, 13-May-2016

  6. [issued] Method of recommending application, mobile terminal using the method, and communication system using the method [link]
    J Nam, M Lee, M Koo, S Yoon
    US 9,247,376, 26-Jan-2016

  7. [issued] Method and apparatus for displaying photo on screen having any shape [link]
    S Yoon, M Lee
    US 9,049,383, 2-Jun-2015

  8. [issued] Method and apparatus for providing information and computer readable storage medium having a program recorded thereon for executing the method [link]
    S Yoon, M Lee, M Koo, J Nam
    US 8,958,824, 17-Feb-2015

  9. [issued] Apparatus and method for clipping and sharing content at a portable terminal [link]
    S Yoon, M Lee, M Koo, J Nam
    US 13/629,394, CN103827913A, EP20120837007, PCT/KR1020110097578, 28-May-2014

  10. [issued] Method and apparatus for fast tracking position by using global positioning system [link]
    S Yoon, S Kim
    US 8,094,070, 10-Jan-2012

[ Korean Patents ]

  1. [issued] Apparatus and method for evaluating sentense by using bidirectional language model
    K Jung, J Shin, S Yoon
    KR 10-2436900, 23-Aug-2022

  2. [issued] Method and apparatus for emotion recognition based on cross attentionmodel
    K Jung, Y Lee, S Yoon
    KR 10-2365433, 16-Feb-2022

  3. [issued] Artificial intelligence based dialog system and response control method thereof
    K Jung, S Yoon, J Shin, H Kwak, S Byun
    KR 10-2059015, 18-Dec-2019

  4. [issued] Terminal apparatus, server and method of controlling the same
    Y Kim, O Kwon, S Kim, H Oh, S Yoon, S Cha, J Lee
    KR 10-1832394, 20-Feb-2018

  5. [issued] Apparatus and method for collecting information of destination in portable terminal
    S Yoon, J Nam, M Koo, M Lee
    KR 10-1914632, 29-Oct-2018

  6. [issued] Method and apparatus for providing information, and computer readable storage medium
    S Yoon, M Lee, M Koo, J Nam
    KR 10-1773167, 24-Aug-2017

  7. [issued] Method for recommendation of application, mobile terminal thereof and communication system thereof
    J Nam, M Lee, M Koo, S Yoon
    KR 10-1747303, 8-Jun-2017

  8. Device and method for analyzing user emotion
    E Rhim, J Kim, J Nam, S Yoon, K Park, J Park, M Cha
    KR 1020160058782, 13-May-2016

  9. [issued] Method and apparatus for fast positioning using global positioning system
    S Yoon, S Kim
    KR 10-1564938, 27-Oct-2015