Large language models (LLMs) as agents for augmented democracy

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

We explore an augmented democracy system built on off-the-shelf large language models (LLMs) fine-tuned to augment data on citizens’ preferences elicited over policies extracted from the government programmes of the two main candidates of Brazil’s 2022 presidential election. We use a train-test cross-validation set-up to estimate the accuracy with which the LLMs predict both: a subject’s individual political choices and the aggregate preferences of the full sample of participants. At the individual level, we find that LLMs predict out of sample preferences more accurately than a ‘bundle rule’, which would assume that citizens always vote for the proposals of the candidate aligned with their self-reported political orientation. At the population level, we show that a probabilistic sample augmented by an LLM provides a more accurate estimate of the aggregate preferences of a population than the non-augmented probabilistic sample alone. Together, these results indicate that policy preference data augmented using LLMs can capture nuances that transcend party lines and represents a promising avenue of research for data augmentation.

This article is part of the theme issue ‘Co-creating the future: participatory cities and digital governance’.

Related collections

Most cited references 68

Record: found
Abstract: found
Article: not found

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, Nick Ryder … (2020)

Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do. Here we show that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches. Specifically, we train GPT-3, an autoregressive language model with 175 billion parameters, 10x more than any previous non-sparse language model, and test its performance in the few-shot setting. For all tasks, GPT-3 is applied without any gradient updates or fine-tuning, with tasks and few-shot demonstrations specified purely via text interaction with the model. GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic. At the same time, we also identify some datasets where GPT-3's few-shot learning still struggles, as well as some datasets where GPT-3 faces methodological issues related to training on large web corpora. Finally, we find that GPT-3 can generate samples of news articles which human evaluators have difficulty distinguishing from articles written by humans. We discuss broader societal impacts of this finding and of GPT-3 in general. 40+32 pages

0 comments Cited 1023 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Fake news on Twitter during the 2016 U.S. presidential election

Nir Grinberg, Kenneth Joseph, Lisa Friedland … (2019)

The spread of fake news on social media became a public concern in the United States after the 2016 presidential election. We examined exposure to and sharing of fake news by registered voters on Twitter and found that engagement with fake news sources was extremely concentrated. Only 1% of individuals accounted for 80% of fake news source exposures, and 0.1% accounted for nearly 80% of fake news sources shared. Individuals most likely to engage with fake news sources were conservative leaning, older, and highly engaged with political news. A cluster of fake news sources shared overlapping audiences on the extreme right, but for people across the political spectrum, most political news exposure still came from mainstream media outlets.

0 comments Cited 383 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Book: not found

BERT: pre-training of deep bidirectional transformers for language understanding

J Devlin, MW Chang, K Lee … (2019)

0 comments Cited 193 times – based on 0 reviews

Bookmark

All references

Author and article information

Contributors

Jairo F. Gudiño: Role: ConceptualizationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – review and editing

Umberto Grandi: Role: ConceptualizationRole: MethodologyRole: Project administrationRole: SupervisionRole: Writing – review and editing

César Hidalgo:

ORCID: https://orcid.org/0000-0002-6977-9492

Role: ConceptualizationRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review and editing

Journal

Journal ID (nlm-ta): Philos Trans A Math Phys Eng Sci

Journal ID (iso-abbrev): Philos Trans A Math Phys Eng Sci

Journal ID (publisher-id): RSTA

Journal ID (hwp): roypta

Title: Philosophical transactions. Series A, Mathematical, physical, and engineering sciences

Publisher: The Royal Society

ISSN (Print): 1364-503X

ISSN (Electronic): 1471-2962

Publication date (Electronic, collection): 16 December 2024

Publication date (Electronic, pub): November 13, 2024

Publication date PMC-release: November 13, 2024

Volume: 382

Issue: 2285

Electronic Location Identifier: 20240100

Affiliations

[ ¹ ]Center for Collective Learning, University of Toulouse & Corvinus University of Budapest; , Toulouse, France

[ ² ]IRIT, Université Toulouse Capitole; , Toulouse, France

[ ³ ]Toulouse School of Economics, Université Toulouse Capitole; , Toulouse, France

Author notes

Electronic supplementary material is available online https://doi.org/10.6084/m9.figshare.c.7484141.

^†Present address: Department of Social and Behavioural Sciences, Toulouse School of Economics, 1 Esplanade de l'Université, Toulouse Cedex 06, Toulouse, 31080, France

One contribution of 17 to a theme issue ‘Co-creating the future: participatory cities and digital governance’.

Author information

César Hidalgo https://orcid.org/0000-0002-6977-9492

Article

Publisher ID: rsta20240100

DOI: 10.1098/rsta.2024.0100

PMC ID: 11776576

PubMed ID: 39533908

SO-VID: 5d537c27-13d0-4198-95bb-21780aafe7bf

License:

Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.

History

Date received : May 6, 2024

Date revision received : July 26, 2024

Date accepted : July 30, 2024

Funding

Funded by: European Research Executive Agency, FundRef http://dx.doi.org/10.13039/100020668;

Funded by: HORIZON EUROPE Climate, Energy and Mobility, FundRef http://dx.doi.org/10.13039/100018700;

Funded by: ANR;

Comments

Comment on this article

scite_

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.