0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Large language models (LLMs) as agents for augmented democracy

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We explore an augmented democracy system built on off-the-shelf large language models (LLMs) fine-tuned to augment data on citizens’ preferences elicited over policies extracted from the government programmes of the two main candidates of Brazil’s 2022 presidential election. We use a train-test cross-validation set-up to estimate the accuracy with which the LLMs predict both: a subject’s individual political choices and the aggregate preferences of the full sample of participants. At the individual level, we find that LLMs predict out of sample preferences more accurately than a ‘bundle rule’, which would assume that citizens always vote for the proposals of the candidate aligned with their self-reported political orientation. At the population level, we show that a probabilistic sample augmented by an LLM provides a more accurate estimate of the aggregate preferences of a population than the non-augmented probabilistic sample alone. Together, these results indicate that policy preference data augmented using LLMs can capture nuances that transcend party lines and represents a promising avenue of research for data augmentation.

          This article is part of the theme issue ‘Co-creating the future: participatory cities and digital governance’.

          Related collections

          Most cited references68

          • Record: found
          • Abstract: found
          • Article: not found

          Language Models are Few-Shot Learners

          Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do. Here we show that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches. Specifically, we train GPT-3, an autoregressive language model with 175 billion parameters, 10x more than any previous non-sparse language model, and test its performance in the few-shot setting. For all tasks, GPT-3 is applied without any gradient updates or fine-tuning, with tasks and few-shot demonstrations specified purely via text interaction with the model. GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic. At the same time, we also identify some datasets where GPT-3's few-shot learning still struggles, as well as some datasets where GPT-3 faces methodological issues related to training on large web corpora. Finally, we find that GPT-3 can generate samples of news articles which human evaluators have difficulty distinguishing from articles written by humans. We discuss broader societal impacts of this finding and of GPT-3 in general. 40+32 pages
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Fake news on Twitter during the 2016 U.S. presidential election

            The spread of fake news on social media became a public concern in the United States after the 2016 presidential election. We examined exposure to and sharing of fake news by registered voters on Twitter and found that engagement with fake news sources was extremely concentrated. Only 1% of individuals accounted for 80% of fake news source exposures, and 0.1% accounted for nearly 80% of fake news sources shared. Individuals most likely to engage with fake news sources were conservative leaning, older, and highly engaged with political news. A cluster of fake news sources shared overlapping audiences on the extreme right, but for people across the political spectrum, most political news exposure still came from mainstream media outlets.
              Bookmark
              • Record: found
              • Abstract: not found
              • Book: not found

              BERT: pre-training of deep bidirectional transformers for language understanding

                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – review and editing
                Role: ConceptualizationRole: MethodologyRole: Project administrationRole: SupervisionRole: Writing – review and editing
                Role: ConceptualizationRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review and editing
                Journal
                Philos Trans A Math Phys Eng Sci
                Philos Trans A Math Phys Eng Sci
                RSTA
                roypta
                Philosophical transactions. Series A, Mathematical, physical, and engineering sciences
                The Royal Society
                1364-503X
                1471-2962
                16 December 2024
                November 13, 2024
                November 13, 2024
                : 382
                : 2285
                : 20240100
                Affiliations
                [ 1 ]Center for Collective Learning, University of Toulouse & Corvinus University of Budapest; , Toulouse, France
                [ 2 ]IRIT, Université Toulouse Capitole; , Toulouse, France
                [ 3 ]Toulouse School of Economics, Université Toulouse Capitole; , Toulouse, France
                Author notes

                Electronic supplementary material is available online https://doi.org/10.6084/m9.figshare.c.7484141.

                Present address: Department of Social and Behavioural Sciences, Toulouse School of Economics, 1 Esplanade de l'Université, Toulouse Cedex 06, Toulouse, 31080, France

                One contribution of 17 to a theme issue ‘Co-creating the future: participatory cities and digital governance’.

                Author information
                https://orcid.org/0000-0002-6977-9492
                Article
                rsta20240100
                10.1098/rsta.2024.0100
                11776576
                39533908
                5d537c27-13d0-4198-95bb-21780aafe7bf
                © 2024 The Author(s).

                Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.

                History
                : May 6, 2024
                : July 26, 2024
                : July 30, 2024
                Funding
                Funded by: European Research Executive Agency, FundRef http://dx.doi.org/10.13039/100020668;
                Funded by: HORIZON EUROPE Climate, Energy and Mobility, FundRef http://dx.doi.org/10.13039/100018700;
                Funded by: ANR;
                Categories
                1003
                1009
                1003
                104
                194
                7
                Research Articles
                Research Articles

                digital democracy,algorithmic democracy,artificial intelligence,digital twins,direct democracy,natural language processing

                Comments

                Comment on this article