Background

Pursuing one’s dream is rarely a smooth path, but I remain deeply grateful to those who have shown me kindness and support along the way. With your encouragement and my continued efforts, I believe that dream will one day become a reality.

I am passionate about developing anthropomorphic agents to support underprivileged populations. My endeavors modify current black-box language models (LMs) to emulate human understanding and reasoning procedure. Specifically, my research directs LMs to construct and reason with structured and symbolic representations from unstructured text.

To know more about my research experience, please check my CV and SOP.

My research connects human cognition theories with NLP through:

Cognitive-Inspired Loop Pretraining:
In Continual Pretraining^[1], I demonstrate that modeling the human NL → KG → NL learning process as a looped pretraining task facilitates performance on downstream knowledge-intensive tasks such as summary, QA, and NLI.
Zone of Proximal Development (ZPD):
In Proc2PDDL^[2], I show that instructing LLMs to incrementally build required skills — aligned with Vygotsky’s ZPD — can effectively support complex Text-to-Code translation tasks.

My research efforts include:

Pretraining language models through a text-knowledge graph reconstruction loop.
Translating natural language text into symbolic representations.
Extracting events and inducing event schemas from natural language text.

My research interests include:

Reasoning in Natural/Symbolic Language
Language Model Pretraining
Interdisciplinary in NLP, CV, and Robotics

Publications

Effective Domain Adaptation of Instruction-Tuned LLMs for Knowledge-Intensive Tasks. In submission 2025
Zhang, T.*, Mai, F.*, Flek, L.
paper
PROC2PDDL: Open-Domain Planning Representations from Texts. NLRSE@ACL 2024
Zhang, T.*, Zhang, L.*, Hou, Z., Wang, Z., Gu, Y., Clark, P., Callison-Burch, C., and Tandon, N.
paper poster oral
PDDLEGO: Iterative Planning in Textual Environments. *SEM 2024
Zhang, L., Jansen, P., Zhang, T., Clark, P., Callison-Burch, C., Tandon, N.
paper oral
WorldWeaver: Procedural World Generation for Text Adventure Games. Wordplay@ACL 2024
Jin, M., Kaul, M., Ramakrishnan, S., Jain, H., Chandrawat, S., Agarwal, I., Zhang, T., Zhu, A., Callison-Burch, C.
paper
Human-in-the-Loop Schema Induction. ACL Demo 2023
Zhang, T.*, Tham, I. *, Hou, Z. *, Ren, J., Zhou, L., Xu, H., Zhang, L., Martin, L., Dror, R., Li, S., Ji, H., Palmer, M., Brown, S., Suchocki, R., and Callison-Burch, C.
paper poster oral
Question-Answering Data Augmentation for Argument Role Labeling. 2022
paper

Education

MSE in Data Science, Jan. 2021 - Dec. 2022
University of Pennsylvania, Philadelphia, America
M.Ed in Learning Science and Technology, Sept. 2018 - Dec. 2019
University of Pennsylvania, Philadelphia, America
B.S in Educational Technology, Sept. 2014 - Jun. 2018
Beijing Normal University, Beijing, China

Tianyi Zhang /张天怡

Background

Publications

Education

Tianyi Zhang /
张天怡