麻豆视频

Whole-message AI communication seen as more useful

As large language models (LLMs) such as GPT-4 are further developed, they will naturally become better at using available information to generate useful text on virtually any topic 鈥 not only by the phrase or sentence, but by the whole document.

Employing AI to write entire messages in an arena where personal correspondence is both crucial and nearly impossible 鈥 representative government 鈥 appears to be more effective than using AI to generate individual sentences, according to new Cornell research.

A research group led by , the John L. Wetherill Professor in the Department of Government in the 麻豆视频 and 麻豆视频 (A&S) and director of the in the Cornell Jeb E. Brooks School of Public Policy, tested an AI-mediated communication program to see whether message-level suggested text was more useful than sentence-level suggestions.

Kreps and her team found that study participants, acting in the role of congressional staffers, who received message-level suggestions responded faster and were more satisfied with the experience than those who got individual sentence suggestions.

鈥淚t鈥檚 almost a cost-benefit-utility calculation,鈥 said Kreps, noting that elected officials can receive thousands of emails per week, sometimes per day. 鈥淥nce you鈥檙e using this tool, if the message-level suggestion is good enough, which it seemed to be, then it makes sense to use the message level rather than the sentence level, where a lot more human interfacing is required.鈥

Kreps鈥 paper, 鈥淐omparing Sentence-Level Suggestions to Message-Level Suggestions in AI-Mediated Communication,鈥 is being published in Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI 鈥23). The lead author is Liye Fu, Ph.D. 鈥22, an applied research scientist at information technology conglomerate Thomson Reuters.

Co-authors Benjamin Newman, a researcher at the Allen Institute for AI in Seattle, and Maurice Jakesch, Ph.D. 鈥22, will present the paper at CHI 鈥23, scheduled for April 23鈥28 in Hamburg, Germany.

Kreps, also an adjunct professor of law, said she got the idea for this work during previous research on whether lawmakers could be susceptible to AI-generated messages. One member of Congress told her that it wouldn鈥檛 be long before 鈥we鈥檙e using AI to respond to AI-written messages,鈥 Kreps said. 鈥淎nd he said, 鈥楾hat would be really great, because we get a lot of emails, and a lot of them are repetitive, so these tools could be really valuable.鈥欌

Lawmakers already outsource 鈥99.999%鈥 of their email correspondence to staffers, Kreps said, so perhaps AI could handle the job. 鈥淪taffers are largely just doing cutting and pasting anyway,鈥 she said. 鈥淪o these AI tools are not actually demonstrably different from what staffers are doing now.鈥

For this work, Fu and a group of undergraduate computer science students from the Cornell Bowers College of Computing and Information Science developed Dispatch, an application that could simulate the process of a staffer responding to constituents鈥 emails. Kreps recruited 120 participants to act as legislative staffers, and put them in one of three experiment conditions: 40 participants received no AI-generated assistance; 40 received sentence-level suggestions; and 40 received message-level suggestions, with both types of suggestions generated by GPT-3.

The researchers sampled letters received by legislators through Resistbot, a service that advertises the ability to compose and send letters to legislators in less than two minutes. The researchers used just the contents of the letters, with no names, and chose letters that were sent by multiple people so individual senders couldn鈥檛 be identified.

鈥淪taffers鈥 using no AI help needed nearly 16 陆 minutes to complete each correspondence, nearly twice as long as those using message-level AI suggestions. Those using sentence-level suggestions took just under 16 minutes, due to the need for editing and message-crafting; the actual writing time was around 12 minutes.

鈥淪taffers鈥 using no AI help needed nearly twice as long as those using message-level AI suggestions. Additionally, those who used the message-level response suggestions generally agreed that the system was easy to use and that the suggestions they received were natural and useful. Participants using sentence-level suggestions, however, did not rate the naturalness and usefulness of the suggestions as favorably.

鈥淭his is a relationship that should have a high degree of empathy and understanding,鈥 Kreps said of the legislator-constituent dynamic. 鈥淐itizens want to feel heard. The problem with that instinct, though, is how far we鈥檝e come from a world where politicians were knocking on doors and having individual conversations and fireside chats. So much of this relationship is already automated.

鈥淚f we can be pragmatic and realistic about where automation has already taken this relationship,鈥 she said, 鈥渢hen it can be easier to go the next step and think about how that actually might help individuals connect with their elected leaders.鈥

This research was funded by a , which encourages A&S faculty to engage in high-impact, boundary-pushing research with potential to secure external support.

.

More News from A&S

White domed building lit up at night
Michael/Unsplash State Capitol Building, Madison, Wisconsin