Scibib

Visual Comparison of Text Sequences Generated by Large Language Models

R. Sevastjanova, S. Vogelbacher, A. Spitz, D. A. Keim, M. El-Assady

The ninth Symposium on Visualization in Data Science (VDS), 2023
Explainability Language Modeling Explainable Artificial Intelligence

Abstract

Causal language models have emerged as the leading technology for automating text generation tasks. Although these models tend to produce outputs that resemble human writing, they still suffer from quality issues (e.g., social biases). Researchers typically use automatic analysis methods to evaluate the model limitations, such as statistics on stereotypical words. Since different types of issues are embedded in the model parameters, the development of automated methods that capture all relevant aspects remains a challenge. To tackle this challenge, we propose a visual analytics approach that supports the exploratory analysis of text sequences generated by causal language models. Our approach enables users to specify starting prompts and effectively groups the resulting text sequences. To this end, we leverage a unified, ontology-driven embedding space, serving as a shared foundation for the thematic concepts present in the generated text sequences. Visual summaries provide insights into various levels of granularity within the generated data. Among others, we propose a novel comparison visualization that slices the embedding space and represents the differences between two prompt outputs in a radial layout. We demonstrate the effectiveness of our approach through case studies, showcasing its potential to reveal model biases and other quality issues.

Materials

Related Publication

R. Sevastjanova, H. Hauptmann, S. Deterding, M. El-Assady,

Personalized Language Model Selection through Gamified Elicitation of Contrastive Concept Preferences

IEEE Transactions on Visualization and Computer Graphics, 2023

R. Sevastjanova, M. El-Assady,

WEC-Explainer: A Descriptive Framework for Exploring Word Embedding Contextualization

Exploring Research Opportunities for Natural Language, Text, and Data Visualization (NLVIZ) Workshop at IEEE VIS, 2023

R. Sevastjanova, A.-L. Kalouli, C. Beck, H. Hauptmann, M. El-Assady

LMFingerprints: Visual Explanations of Language Model Embedding Spaces through Layerwise Contextualization Scores

Eurographics Conf. on Visualization (EuroVis), 2022

R. Sevastjanova, M. El-Assady,

Beware the Rationalization Trap! When Language Model Explainability Diverges from our Mental Models of Language

Communication in Human-AI Interaction Workshop at IJCAI-ECAI'22, 2022

R. Sevastjanova, E. Cakmak, S. Ravfogel, R. Cotterell, M. El-Assady

Visual Comparison of Language Model Adaptation

IEEE Trans. on Visualization and Computer Graphics, 2022