Mestrado em Informática

URI Permanente para esta coleção

http://repositorio.ufes.br/handle/10/17627

Nível: Mestrado Acadêmico
Ano de início:
Conceito atual na CAPES:
Ato normativo:
Periodicidade de seleção:
Área(s) de concentração:
Url do curso:

Navegar

Agora exibindo 1 - 1 de 1

Analysis of bias in GPT language models through fine-tuning with anti-vaccination speech
(Universidade Federal do Espírito Santo, 2024-12-02) Turi, Leandro Furlam; Badue, Claudine; Souza, Alberto Ferreira de; https://orcid.org/0000-0003-1561-8447; Pacheco, Andre Georghton Cardoso; Almeida Junior, Jurandy Gomes de
We examined the effects of integrating data containing divergent information, particularly concerning anti-vaccination narratives, in training a GPT-2 language model by fine-tuning it using content from anti-vaccination groups and channels on Telegram. Our objective was to analyze the model’s ability to generate coherent and rationalized texts compared to a model pre-trained on OpenAI’s WebText dataset. The results demonstrate that fine-tuning a GPT-2 model with biased data leads the model to perpetuate these biases in its responses, albeit with a certain degree of rationalization, highlighting the importance of using reliable and high-quality data in the training of natural language processing models and underscoring the implications for information dissemination through these models. We also explored the impact of data poisoning by incorporating anti-vaccination messages combined with general group messages in different proportions, aiming to understand how exposure to biased data can influence text generation and the introduction of harmful biases. The experiments highlight the change in frequency and intensity of anti-vaccination content generated by the model and elucidate the broader implications for reliability and ethics in using language models in sensitive applications. This study provides social scientists with a tool to explore and understand the complexities and challenges associated with misinformation in public health through the use of language models, particularly in the context of vaccine misinformation.

Navegar

Navegando Mestrado em Informática por Autor "Almeida Junior, Jurandy Gomes de"

Resultados por página

Opções de Ordenação