Human-Centered Large Language Modeling Workshop 2024

Follow us on X: @HuCLLM

The schedule for oral and poster sessions is now out!: Paper Sessions Schedule

Workshop Registration Information (Early Deadline: Sunday, July 21): https://2024.aclweb.org/registration

A word's meaning resides in the heart and soul of its "generator" - people. How do we include human (personal, social, cultural, situational) context, ethically, into LLMs -- the base models of our NLP systems?

Overview

Language modeling in the context of its source (author) and target (audience) can enable NLP systems to better understand human language. Advances in human-centered NLP have established the importance of modeling the human context holistically, including personal, social, cultural, and situationa factors in NLP systems. Yet, our NLP systems have become heavily reliant on large language models that do not capture the human context.

Human language is highly dependent on the rich and complex human context such as (a) who is speaking, (b) to whom, (c) where (situation/environment) and (d) when (time and place). It is additionally moderated by the changing human states of being such as their mental and emotional states.

Current large language models can possibly simulate some form of human context given their large scale of parameters and pre-training data. However, they do not explicitly process language in the higher order structure of language – connecting documents to people, the "source" of the language.

Prior work has demonstrated the benefits of including the author’s information using LLMs for downstream NLP tasks. Recent research has also shown that LLMs can benefit from including additional author context in the LM pre-training task itself. Progress in the direction of merging the two successful parallels, i.e., human-centered NLP and LLMs, drives us toward creating a vision of human-centered LLMs for the future of NLP in the era of LLMs.

With our first edition of the Human-Centered Large Language Modeling (HuCLLM) workshop, we aim to create a platform where researchers can present rising challenges and solutions in building human-centered NLP models that bring together the ideas of human and social factors adaptation into the base LLMs of our NLP systems.

Call for Papers

Human-centered large language modeling has the potential to bring promising improvements in human-centric applications through multiple domains such as healthcare, education, consumerism, etc. Simultaneously, this new research focus also brings multitudes of unexplored architectural, data, technical, fairness, and ethical challenges.

We invite submissions on topics that include, but are not limited to:

Human-centered LLM training/fine-tuning: Strategies to include the human context of the speaker and/or addressee, such as their personal factors, social context, etc.; Integrating group and/or individual human characteristics and traits; Human language modeling with multi-lingual LLMs or low-resource languages
Analysis and Applications: Evaluations for human language modeling that demonstrates personalized or socially contextual language understanding; Empirical findings with human language modeling demonstrating failure cases with an exhaustive analysis of negative results; Bias measurement and bias mitigation using human language modeling; Applications built on top of LLMs for real-world uses or translational impact
Datasets: Obtaining data for training and evaluating human contextualized LLM models
Position papers: Position papers on opportunities and challenges, including ethical risks

Archival Submissions

Authors are invited to submit long (8 pages) or short (4 pages) papers, with unlimited pages for references and appendices. Following the ACL conference policy, authors of approved papers will be given an additional page for the final, camera-ready versions of their papers.

Please ensure that the submissions are formatted according to the ACL template style. You can access the template here.

Non-Archival Submissions

We welcome non-archival submissions through two tracks.

(1) Extended Abstract (2-4 pages) or Full Length Short (4 pages) and Long (8 pages) Papers: You can submit an extended abstract of work not published elsewhere, of length 2-4 pages or full-length short (4 pages) or long (8 pages) papers. This can include position papers, or early stage work that would benefit from peer feedback. These submissions will also be peer-reviewed in a double-blind fashion, similar to the archival papers. Please use the OpenReview submission links below for submission of non-archival (extended abstract/short/long).
(2) Published Papers: Work previously published, or accepted to be published elsewhere (e.g., ACL Findings) can also be submitted to the non-archival track, along with details about the venue or journal where it is accepted, and a link to the archived version, if available. These papers will be reviewed in a single-blind fashion, and will be reviewed only for the fit to the workshop theme, and do not have any page limits. Please use this google form (https://forms.gle/ffwovoMhBm6vLhV17) for submission of non-archival (published papers).

Please ensure that the submissions are formatted according to the ACL template style. You can access the template here. Accepted papers in the two non-archival tracks will be given an opportunity to present the work at the workshop, but will not be published in the ACL Anthology.

Important Dates

~~May 10 (Fri), 2024~~ May 20 (Mon), 2024: Direct paper submission deadline (archival short/long and non-archival extended abstract/short/long)
~~May 17 (Fri), 2024~~ May 20 (Mon), 2024: ARR commitment deadline (Submission of already ARR-reviewed papers with the paper link)
June 17 (Mon), 2024: Notification of acceptance
June 25 (Tue), 2024: Non-Archival (published papers) submission deadline
July 1 (Mon), 2024: Non-archival (published papers) notification of acceptance
July 1 (Mon), 2024: Camera-ready paper due
August 15 (Thu), 2024: Workshop date

All deadlines are 11:59 pm UTC -12h ("Anywhere on Earth").

Submission Links

Direct Paper Submission (Archival (short/long) and Non-archival (extended abstract/short/long)): https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/HuCLLM
Non-archival (published papers) Submission: https://forms.gle/ffwovoMhBm6vLhV17
ARR Commitment Page: https://openreview.net/group?id=aclweb.org/ACL/2024/Workshop/HuCLLM_ARR_Commitment

Note: All authors must have an OpenReview profile. Please ensure profiles are complete before submission. As per OpenReview's moderation policy for newly created profiles:

New profiles created without an institutional email will go through a moderation process that can take up to two weeks.
New profiles created with an institutional email will be activated automatically.

If you have any questions, please contact us at: workshophucllm@googlegroups.com

Topics of Interest

The areas of interest include:

LLM training/fine-tuning strategies to include the human context of the speaker and/or addressee, such as their personal factors, social context etc.
LLM training integrating group and/or individual human characteristics and traits
Evaluations for human language modeling that demonstrates personalized or socially contextual language understanding
Bias measurement and bias mitigation using human language modeling
Obtaining data for training and evaluating human contextualized LLMs models
Human language modeling with multi-lingual LLMs or low-resource languages
Position papers on opportunities and challenges, including ethical risks
Empirical findings with human language modeling demonstrating failure cases with an exhaustive analysis of negative results
Applications built on top of LLMs for real-world uses or translational impact

Keynote Speakers

In-Person

Cristian Danescu-Niculescu-Mizil
Cornell University, USA

Barbara Plank
LMU Munich, Germany

Vered Shwartz
University of British Columbia, Canada

Virtual

Daniel Hershcovich
University of Copenhagen, Denmark

Panelists

In-Person

Carolyn Rosé
Carnegie Mellon University, USA

Barbara Plank
LMU Munich, Germany

Vinodkumar Prabhakaran
Google Research, USA

Vered Shwartz
University of British Columbia, Canada

Virtual

Snigdha Chaturvedi
University of North Carolina at Chapel Hill, USA

Kayden Jordan
Harrisburg University of Science and Technology, USA

Debora Nozza
Bocconi University, Italy

Schedule

Time	Schedule
9:00 - 9:10	Opening Remarks
9:10 - 09:55	Keynote 1 - Dr. Cristian Danescu-Niculescu-Mizil: Assisting Human-Human Communication With Artificial Conversational Intuition
09:55 - 10:30	Oral Session 1
10:30 - 11:00	Coffee Break
11:00 - 11:45	Keynote 2 - Dr. Barbara Plank: Human-centered NLP: We are (still) not paying enough attention to human label variation
11:45 - 13:45	Poster Session
12:45 - 13:45	Lunch Break
13:45 - 14:15	Keynote 3 - Dr. Daniel Hershcovich (UCPH): Reversing the Alignment Paradigm: LLMs Shaping Human Cultural Norms, Behaviors, and Attitudes
14:15 - 15:00	Oral Session 2
15:00 - 15:30	Brainstorming Session
15:30 - 16:00	Coffee Break
16:00 - 16:30	Keynote 4 - Dr. Vered Shwartz: Navigating Cultural Adaptation of LLMs: Knowledge, Context, and Consistency
16:30 - 17:25	Panel discussion: including following topics: Human-centeredness, Evaluation, Diversity of participation, Impact, challenges and effects, Future / Actions
17:25 - 17:30	Closing Remarks

Organizers

Nikita Soni
Stony Brook University, USA

Lucie Flek
University of Bonn, Germany

Ashish Sharma
University of Washington, USA

Diyi Yang
Stanford University, USA

Sara Hooker
Cohere AI, USA

H Andrew Schwartz
Stony Brook University, USA

If you have any questions, please contact us at: workshophucllm@googlegroups.com

Program Committee

Akbar Karimi, Rheinische Friedrich-Wilhelms Universität Bonn, Germany
Akhila Yerukola, Carnegie Mellon University, USA
Amanda Curry, Bocconi University, Italy
Anvesh Rao Vijjini, University of North Carolina, Chapel Hill, USA
Athiya Deviyani, Carnegie Mellon University, USA
Barbara Plank, LMU Munich, Germany
Chia-Chien Hung, NEC Labs Europe, Germany
Elizabeth Clark, Google, USA
Gavin Abercombie, Heriot-Watt University, Scotland
Giuseppe Attanasio, Bocconi University, Italy
Hannah Rashkin, Google, USA
Harmanpreet Kaur, University of Michigan, USA
Hwanhee Lee, Chung-Ang University, South Korea
Hye Sun Yun, Northeastern University, USA
Ian Stewart, Pacific Northwest National Laboratory, USA
Inna Lin, University of Washington, USA
Jaemin Cho, University of North Carolina Chapel Hill, USA
Jielin Qiu, Carnegie Mellon University, USA
Jimin Mun, Carnegie Mellon University, USA
Joan Plepi, University of Marburg, Germany
Jonathan K. Kummerfeld, University of Sydney, Australia
Karina H Halevy, Carnegie Mellon University, USA
Lucy Li, University of California, Berkeley, USA
Maria Antoniak, Allen Institute for AI, USA
Matthias Orlikowski, Bielefeld University, Germany
Meryem M'hamdi, University of Southern California, USA
Monica Munnangi, Northeastern University, USA
Naba Rizvi, University of California, San Diego, USA
Nicole Meister, Stanford University, USA
Paul Röttger, Bocconi University, Italy
Salvatore Giorgi, University of Pennsylvania, USA
Sayan Ghosh, University of Southern California, USA
Shaily Bhatt, Carnegie Mellon University, USA
Shaina Ashraf, Marburg University, Germany
Shijia Liu, Northeastern University, USA
Shreya Havaldar, University of Pennsylvania, USA
Siva Uday Sampreeth Chebolu, University of Houston, USA
Vahid Sadiri Javadi, Technische Universität Chemnitz, Germany
Vivek Kulkarni, University of California, Santa Barbara, USA
Wei-Fan Chen, Rheinische Friedrich-Wilhelms Universität Bonn, Germany
Zeerak Talat, Independent Researcher

Volunteer

Mounika Marreddy, University of Bonn, Germany

Overview

Call for Papers

Archival Submissions

Non-Archival Submissions

Important Dates

Submission Links

Topics of Interest

Keynote Speakers

Cristian Danescu-Niculescu-Mizil Cornell University, USA

Barbara Plank LMU Munich, Germany

Vered Shwartz University of British Columbia, Canada

Daniel Hershcovich University of Copenhagen, Denmark

Panelists

Carolyn Rosé Carnegie Mellon University, USA

Barbara Plank LMU Munich, Germany

Vinodkumar Prabhakaran Google Research, USA

Vered Shwartz University of British Columbia, Canada

Snigdha Chaturvedi University of North Carolina at Chapel Hill, USA

Kayden Jordan Harrisburg University of Science and Technology, USA

Debora Nozza Bocconi University, Italy

Schedule

Organizers

Nikita Soni Stony Brook University, USA

Lucie Flek University of Bonn, Germany

Ashish Sharma University of Washington, USA

Diyi Yang Stanford University, USA

Sara Hooker Cohere AI, USA

H Andrew Schwartz Stony Brook University, USA

Program Committee

Volunteer

Cristian Danescu-Niculescu-Mizil
Cornell University, USA

Barbara Plank
LMU Munich, Germany

Vered Shwartz
University of British Columbia, Canada

Daniel Hershcovich
University of Copenhagen, Denmark

Carolyn Rosé
Carnegie Mellon University, USA

Barbara Plank
LMU Munich, Germany

Vinodkumar Prabhakaran
Google Research, USA

Vered Shwartz
University of British Columbia, Canada

Snigdha Chaturvedi
University of North Carolina at Chapel Hill, USA

Kayden Jordan
Harrisburg University of Science and Technology, USA

Debora Nozza
Bocconi University, Italy

Nikita Soni
Stony Brook University, USA

Lucie Flek
University of Bonn, Germany

Ashish Sharma
University of Washington, USA

Diyi Yang
Stanford University, USA

Sara Hooker
Cohere AI, USA

H Andrew Schwartz
Stony Brook University, USA