• Conference Paper
  • 2025

    SeqAR: Jailbreak LLMs with Sequential Auto-Generated Characters
    Yan Yang, Xin Lu, Hongru WANG, et al., Guanhua Chen, Yun Chen
    NAACL 2025
    Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
    YuZhao, et al., Hongru WANG, Xuanli He, Kam-Fai Wong, Pasquale Minervini
    NAACL 2025 (tool conflicts with each other, how to detect and control the behaviors)
    Self-DC: When to retrieve and When to generate? Self Divide-and-Conquer for ..
    Hongru Wang, Boyang Xue, Baohang Zhou, et. al, Kam-fai Wong
    NAACL 2025 (tool management regarding internal tools and external tools)

    2024

    2023

    2022

  • Journal
  • KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System
    Hongru WANG, Wai-Chung Kwan, Min Li, Zimo Zhou, Kam-Fai Wong
    Computer Speech and Language (SCI Q2, IF: 4.3)
    A Survey on Recent Advances and Challenges in Reinforcement Learning Methods ..
    Wai-Chung Kwan*, Hongru WANG*, Huimin Wang, Kam-Fai Wong
    Machine Intelligence Research

  • Workshop, Tutorial and Others
  • Analysing the Residual Stream of Language Models Under Knowledge Conflicts
    Yu Zhao, Xiaotang Du, et. al, Hongru Wang, Xuanli He, Kam-Fai Wong, Pasquale Minervini
    MINT Workshop, NeurIPS 2024
    Fine-tuning after Prompting: an Explainable Way for Classification
    Zezhong Wang, Luyao Ye, Hongru WANG, Boyang Xue, Yiming Du, Bin Liang, and Kam-Fai Wong
    SIGHAN, ACL 2024
    PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Fusion ..
    Yiming Du, Hongru WANG, Zhengyi Zhao, et. al, and Kam-Fai Wong
    SIGHAN, ACL 2024 ( Best Paper Award) [Certification]
    OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
    Jingtao Cao, Zheng Zhang, Hongru WANG, Bin Liang, Hao Wang, Kam-fai Wong
    Online Safety Prize Challenge, WWW 2024 ( Champion)
    Empowering Large Language Models: Tool Learning for Real-World Interaction
    Hongru WANG, Yujia Qin, Yankai Lin, Jeff Z. Pan and Kam-Fai Wong
    Tutorial, SIGIR 2024 (first comprehensive tutorial about tool learning)