{"title":"Visual analytics for monitoring credit scoring models","authors":"Daiane Rodrigues Baldo, Murilo Santos Regio, Isabel Harb Manssour","doi":"10.1177/14738716231180803","DOIUrl":"https://doi.org/10.1177/14738716231180803","url":null,"abstract":"Financial institutions use credit Scoring models to predict the default of their customers and assist in decision-making about the granting of credit. As a large volume of credit transactions is generated daily alongside a potential increase in this information with the advent of Open Finance, it is challenging to monitor this information quickly so we can act in case these models lose performance. Considering this context, our research aims to provide a Visual Analytics approach to assist in monitoring credit models. For this, initially, we carried out a systematic review of the literature on the subject and conducted semi-structured interviews with 13 domain experts. Considering the needs raised with this study, we created a prototype called Visual Analytics for monitoring Credit Scoring models (VACS). The main contributions of this work are twofold: The requirements gathered through interviews with specialists, which allowed the analysis of how the models are monitored within financial institutions, something that is not disclosed and that can help in the standardization of the monitoring process; and VACS, which was evaluated by four domain experts who considered it a very complete and easy-to-use tool.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135672678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Waffster: Hierarchical waffle charts for budget visualization","authors":"Antoine Béland, Florent Daudens, Thomas Hurtut","doi":"10.1177/14738716231173730","DOIUrl":"https://doi.org/10.1177/14738716231173730","url":null,"abstract":"Understanding and consuming public budget data is a key issue, helping citizens in gaining insight into their democratic and political systems. The goal of this work is to present Waffster, a user-friendly representation supporting the understanding of such data. The proposed representation enables the browsing, searching, comparing, and presenting of the hierarchically arranged components and quantities in budgets. In this paper, we first conduct a thorough survey of online public budget visualizations. Then, in collaboration with Le Devoir, a Canadian daily newspaper, we propose a novel unit-based hierarchical design based on waffle charts. We evaluate this design using a controlled user study to compare it to a tree-map based layout, and a case study conducted with Le Devoir during the provincial election campaign in Québec of 2018.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135643711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparing overlapping data distributions using visualization","authors":"Eric Newburger, Niklas Elmqvist","doi":"10.1177/14738716231173731","DOIUrl":"https://doi.org/10.1177/14738716231173731","url":null,"abstract":"We present results from a preregistered and crowdsourced user study where we asked members of the general population to determine whether two samples represented using different forms of data visualizations are drawn from the same or different populations. Such a task reduces to assessing whether the overlap between the two visualized samples is large enough to suggest similar or different origins. When using idealized normal curves fitted on the samples, it is essentially a graphical formulation of the classic Student’s t-test. However, we speculate that using more sophisticated visual representations, such as bar histograms, Wilkinson dot plots, strip plots, or Tukey boxplots will both allow people to be more accurate at this task as well as better understand its meaning. In other words, the purpose of our study is to explore which visualization best scaffolds novices in making graphical inferences about data. However, our results indicate that the more abstracted idealized bell curve representation of the task yields more accuracy.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135184672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Archit Rathore, Yichu Zhou, Vivek Srikumar, Bei Wang
{"title":"TopoBERT: Exploring the topology of fine-tuned word representations","authors":"Archit Rathore, Yichu Zhou, Vivek Srikumar, Bei Wang","doi":"10.1177/14738716231168671","DOIUrl":"https://doi.org/10.1177/14738716231168671","url":null,"abstract":"Transformer-based language models such as BERT and its variants have found widespread use in natural language processing (NLP). A common way of using these models is to fine-tune them to improve their performance on a specific task. However, it is currently unclear how the fine-tuning process affects the underlying structure of the word embeddings from these models. We present TopoBERT, a visual analytics system for interactively exploring the fine-tuning process of various transformer-based models – across multiple fine-tuning batch updates, subsequent layers of the model, and different NLP tasks – from a topological perspective. The system uses the mapper algorithm from topological data analysis (TDA) to generate a graph that approximates the shape of a model’s embedding space for an input dataset. TopoBERT enables its users (e.g. experts in NLP and linguistics) to (1) interactively explore the fine-tuning process across different model-task pairs, (2) visualize the shape of embedding spaces at multiple scales and layers, and (3) connect linguistic and contextual information about the input dataset with the topology of the embedding space. Using TopoBERT, we provide various use cases to exemplify its applications in exploring fine-tuned word embeddings. We further demonstrate the utility of TopoBERT, which enables users to generate insights about the fine-tuning process and provides support for empirical validation of these insights.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42372953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Voronoi treemap in Manhattan distance and Chebyshev distance","authors":"Yan Chao Wang, Yi Xing, J. Zhang","doi":"10.1177/14738716231167181","DOIUrl":"https://doi.org/10.1177/14738716231167181","url":null,"abstract":"The ordinary Voronoi treemap generated based on the Euclidean distance function has the flexibility to slightly adjust the layout when visualizing time-varying hierarchical data for better visual quality, preserving neighborhood relationships, and preserving a stable layout. However, its layout formed by segments with arbitrary orientations has poor shape stability between successive layouts at different time indexes, which is not conducive for the users to understand the plot and track the same node. In this paper, we propose novel Voronoi treemaps in Manhattan distance and Chebyshev distance respectively, such that the segments in the new layouts only have four orientations (horizontal, vertical, and ±45° to the x -axis). The new layouts can not only heritage the abilities of ordinary Voronoi treemap, but preserve good shape stability. To achieve this, we first focus on the weighted bisector between two sites in Manhattan distance and design a bisector generation method for different weight values of two sites, as the bisector is the foundation to form a diagram. Then a divide-and-conquer method is utilized to form the bisectors into a Voronoi diagram, and a Voronoi treemap layout can be finally obtained by using Lloyd’s method to iteratively adjust the diagram. Moreover, we prove that the treemap algorithm in Manhattan distance can be adjusted to also generate the Voronoi treemap in Chebyshev distance via linear transformation, avoiding designing additional algorithm. The computational properties of the proposed methods are first evaluated to check whether the layouts can be generated fast and accurately. Then the perceptual properties are evaluated quantitatively and qualitatively based on quality metrics and user studies, respectively. The results demonstrate that the proposed Voronoi treemaps preserve similar layout stability, but better visual quality and shape stability than the ordinary Voronoi treemap. Our algorithms are simple and resolution-independent, but also provide alternatives to the Voronoi treemaps.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2023-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45232514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The role of individual characteristics: How thinking style and domain expertise affect performances on visualization","authors":"S. Tomasi, Jeanny Liu, Feng Cheng, Chaodong Han","doi":"10.1177/14738716231167180","DOIUrl":"https://doi.org/10.1177/14738716231167180","url":null,"abstract":"Widely employed by innovative organizations, a well-designed simple data visualization has been shown to enhance user experience and aid in decision making; while a more embellished visualization may cause overload, it has the potential to create deeper processing and learning. Furthermore, individual characteristics may impact on how users seek information on these different types of visualization. This study proposes that thinking styles (analytical vs holistic) and domain expertise moderate the effects of data visualization types on decision performances in terms of decision accuracy, decision confidence, memory recall, and cognitive load. To test our hypotheses, an experimental study involving visual manipulations in the context of personal finance was conducted on two types of visualizations (simple and clutter). Results suggest that simple visualizations enhance decision accuracy and reduce cognitive load. We also find that cognitive load is further reduced when analytical thinkers are presented with simple visualizations. These findings can help designers understand how user characteristics may be considered when designing and evaluating visualizations for decision makers.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2023-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48112104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
L. Stuart, Christopher Haynes, K. Tantam, R. Gardner, Marco A. Palomino
{"title":"Visualizing the recovery of patients in Critical Care Units","authors":"L. Stuart, Christopher Haynes, K. Tantam, R. Gardner, Marco A. Palomino","doi":"10.1177/14738716231158046","DOIUrl":"https://doi.org/10.1177/14738716231158046","url":null,"abstract":"This paper presents a detailed case study of the application of techniques from Information Visualization to data collected in Critical Care Units (CCUs). This data is heterogeneous and sometimes incomplete due to the pressures on staff in the environment. Thus, it can be difficult to use conventional means to visualize it meaningfully. The paper presents the software tool called CCViews. It was developed to support visualization of CCU data. It enables clinicians to view the trajectory of patient recovery and track the effectiveness of different interventions such as physiotherapy. Note that this work is underpinned by the world-famous information seeking mantra, which emphasizes the need to provide users with views of their data at differing levels of granularity.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2023-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41719207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Debanjan Datta, Nathan Self, J. Simeone, A. Meadows, Willow Outhwaite, Linda Walker, N. Elmqvist, Naren Ramkrishnan
{"title":"TimberSleuth: Visual anomaly detection with human feedback for mitigating the illegal timber trade","authors":"Debanjan Datta, Nathan Self, J. Simeone, A. Meadows, Willow Outhwaite, Linda Walker, N. Elmqvist, Naren Ramkrishnan","doi":"10.1177/14738716231157081","DOIUrl":"https://doi.org/10.1177/14738716231157081","url":null,"abstract":"Detecting illegal shipments in the global timber trade poses a massive challenge to enforcement agencies. The massive volume and complexity of timber shipments and obfuscations within international trade data, intentional or not, necessitates an automated system to aid in detecting specific shipments that potentially contain illegally harvested wood. To address these requirements we build a novel human-in-the-loop visual analytics system called TIMBERSLEUTH. TimberSleuth uses a novel scoring model reinforced through human feedback to improve upon the relevance of the results of the system while using an off-the-shelf anomaly detection model. Detailed evaluation is performed using real data with synthetic anomalies to test the machine intelligence that drives the system. We design interactive visualizations to enable analysis of pertinent details of anomalous trade records so that analysts can determine if a record is relevant and provide iterative feedback. This feedback is utilized by the machine learning model to improve the precision of the output.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47208835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Is embodied interaction beneficial? A study on navigating network visualizations","authors":"Helen H. Huang, Hanspeter Pfister, Yalong Yang","doi":"10.1177/14738716231157082","DOIUrl":"https://doi.org/10.1177/14738716231157082","url":null,"abstract":"Network visualizations are commonly used to analyze relationships in various contexts, such as social, biological, and geographical interactions. To efficiently explore a network visualization, the user needs to quickly navigate to different parts of the network and analyze local details. Recent advancements in display and interaction technologies inspire new visions for improved visualization and interaction design. Past research into network design has identified some key benefits to visualizing networks in 3D versus 2D. However, little work has been done to study the impact of varying levels of embodied interaction on network analysis. We present a controlled user study that compared four network visualization environments featuring conditions and hardware that leveraged different amounts of embodiment and visual perception ranging from a 2D visualization desktop environment with a standard mouse to a 3D visualization virtual reality environment. We measured the accuracy, speed, perceived workload, and preferences of 20 participants as they completed three network analytic tasks, each of which required unique navigation and substantial effort to complete. For the task that required participants to iterate over the entire visualization rather than focus on a specific area, we found that participants were more accurate using a VR HMD and a trackball mouse than conventional desktop settings. From a workload perspective, VR was generally considered the least mentally demanding and least frustrating to use in two of our three tasks. It was also preferred and ranked as the most effective and visually appealing condition overall. However, using VR to compare two side-by-side networks was difficult, and it was similar to or slower than other conditions in two of the three tasks. Overall, the accuracy and workload advantages of conditions with greater embodiment in specific tasks suggest promising opportunities to create more effective environments in which to analyze network visualizations.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2023-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47670991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Providing visual analytics guidance through decision support","authors":"Wenkai Han, Hans-Jörg Schulz","doi":"10.1177/14738716221147289","DOIUrl":"https://doi.org/10.1177/14738716221147289","url":null,"abstract":"Guidance in visual analytics aims to support users in accomplishing their analytical goals and generating insights. Different approaches for guidance are widely adopted in many tools and frameworks for various purposes – from helping to focus on relevant data subspaces to selecting suitable visualization techniques. With each of these different purposes come specific considerations on how to provide the needed guidance. In this paper, we propose a generic method for making these considerations by framing the guidance problem as a decision problem and applying decision making theory and models toward its solution. This method passes through three stages: (1) identifying decision points; (2) deriving and evaluating alternatives; (3) visualizing the resulting alternatives to support users in comparing them and making their choice. Our method is realized as a set of practical worksheets and illustrated by applying it to a use case of providing guidance among different clustering methods. Finally, we compare our method with existing guidance frameworks to relate and delineate the respective goals and contributions of each.","PeriodicalId":50360,"journal":{"name":"Information Visualization","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43349249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}