-
Language Models over Canonical Byte-Pair Encodings
ICML 2025
Tim Vieira, Tianyu Liu, Clemente Pasti, Yahya Emara, Brian DuSell, Benjamin LeBrun, Mario Giulianelli, Juan Luis Gastaldi, Timothy J. O’Donnell, Ryan Cotterell -
Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo
ICLR 2025, Oral presentation
João Loula, Benjamin LeBrun, Li Du, Ben Lipkin, Clemente Pasti, Gabriel Grand, Tianyu Liu, Yahya Emara, Marjorie Freedman, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Alexander K. Lew, Tim Vieira, Timothy J. O’Donnell -
Pointwise mutual information as a performance gauge for retrieval-augmented generation
NAACL 2025
Tianyu Liu*, Jirui Qi*, Paul He, Arianna Bisazza, Mrinmaya Sachan, Ryan Cotterell -
Efficiently Computing Susceptibility to Context in Language Models
EMNLP Findings 2024
Tianyu Liu, Kevin Du, Mrinmaya Sachan, Ryan Cotterell -
A Probability–Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors
EMNLP 2024
Naaman Tan, Josef Valvoda, Tianyu Liu, Anej Svete, Yanxia Qin, Kan Min-Yen, Ryan Cotterell -
Linear-time modeling of linguistic structure: An order-theoretic perspective
EMNLP 2023, Outstanding Paper Award
Tianyu Liu, Afra Amini, Mrinmaya Sachan, Ryan Cotterell -
Formal aspects of language modeling
Preprint
Ryan Cotterell, Anej Svete, Clara Meister, Tianyu Liu, Li Du -
A geometric notion of causal probing
Preprint
Clément Guerner, Tianyu Liu, Anej Svete, Alexander Warstadt, Ryan Cotterell -
Hexatagging: Projective dependency parsing as tagging
ACL 2023, Outstanding Paper Award
Afra Amini*, Tianyu Liu*, Ryan Cotterell -
Discourse-centric evaluation of document-level machine translation with a new densely annotated parallel corpus of novels.
ACL 2023
Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Mrinmaya Sachan, Ryan Cotterell -
A Bilingual Parallel Corpus with Discourse Annotations
Preprint
Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Mrinmaya Sachan, Ryan Cotterell -
Autoregressive structured prediction with language models
EMNLP Findings 2022
Tianyu Liu, Yuchen Jiang, Nicholas Monath, Ryan Cotterell, Mrinmaya Sachan -
BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation
NAACL 2022
Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Jian Yang, Haoyang Huang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan, Ming Zhou -
A Structured Span Selector
NAACL 2022
Tianyu Liu, Yuchen Eleanor Jiang, Ryan Cotterell, Mrinmaya Sachan -
Learning to Explain Ambiguous Headlines of Online News
IJCAI 2018
Tianyu Liu, Wei Wei, Xiaojun Wan