MSTR: Mutli-Scale Transformer for End-to-End Human-Object Interaction Detection
Kim Bumsoo, Mun Jonghwan, On Kyoung-Woon, Shin Minchul, Lee Junhyun, and Kim Eun-Sol
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022
Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Ko, Dohwan, Choi, Joonmyung, Ko, Juyeon, Noh, Shinyeong, On, Kyoung-Woon, Kim, Eun-Sol, and Kim, Hyunwoo J
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022
Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Heo, Yu-Jung, Kim, Eun-Sol, Choi, Woosuk, and Zhang, Byoung-Tak
In In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2022
Semantic Alignment with Calibrated Similarity for Multilingual Sentence Embedding
Ham, Jiyeon, and Kim, Eun-Sol
In Findings of the Association for Computational Linguistics: EMNLP 2021 2021
HOTR: End-to-End Human-Object Interaction Detection with Transformers
Kim, Bumsoo, Lee, Junhyun, Kang, Jaewoo, Kim, Eun-Sol, and Kim, Hyunwoo J
In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021 (Oral Presesentation)