5/25/23
In 1990, Microsoft Excel 3 introduced the ability to create 3D charts (Walkenbach 2020). Multiple studies have shown the downsides to unnecessarily using these types of charts.
2D graphs had more confident answers and novice participants were more accurate with 2D graphs than 3D graphs (Barfield and Robless 1989)
Visual appeal for 2D and 3D graphs were approximately equal, and simple graphs were preferred for extracting information (Fisher, Dempsey, and Marousky 1997)
Problems with 3D graphs
When projected on a 2D surface, the data is distorted
When the third dimension includes data, the viewing angle can skew perception
Appropriate uses for 3D graphs
Viewing angle is not a concern when users can interact with the visualization
Mapping data onto 3D objects
Research questions:
Our study draws influence from Cleveland and McGill’s 1984 paper, specifically from their position-length experiment with graph types 1 and 3 (Cleveland and McGill 1984).
Overall goal was to establish which graphical elements are the most important for extracting information
Types 1-3 measure position, and types 4-5 measure length
Graph types for position-length experiment from Cleveland and McGill
In their study, participants were asked which of the marked bars were smaller and by approximately how much.
Values involved in comparison judgments are
\[V_i=10\cdot10^{(i-1)/12}\quad i=1,\dots, 10\]
To closely replicate values, we made three assumptions about the comparisons from Cleveland and McGill
Match the exact ratios that were used
No value was used more than twice
All other values are randomly generated
We designed our study to be given to members of our statistics department and their adult roommates/partners.
Treatment Design
There are 42 treatment combinations, too many for a busy subject!
Design of each kit
Each subject receives a kit of 3D printed charts with instructions that direct users to a Shiny app
\[y_{ijklm}=\mu+S_i+R_j+G(R)_{(k)j}+T_l+\epsilon_{ijklm}\]
where
\(y_{ijklm}=\log_2(|\text{Judged Percent} - \text{True Percent}|+1/8)\)
\(S_i\sim N(0,\sigma^2_S)\) is the effect of the \(i^{th}\) subject
\(R_j\) is the effect of the \(j^{th}\) ratio
\(G(R)_{(k)j}\) is the effect of the \(k^{th}\) graph type nested in the \(j^{th}\) ratio
\(T_l\) is the effect of the \(l^{th}\) comparison type
\(\epsilon_{ijklm}\sim N(0,\sigma^2_\epsilon)\) is the random error
| Term | SS | MS | Num DF | Den DF | F-value | P-value |
|---|---|---|---|---|---|---|
| ratio | 0.276 | 0.276 | 1 | 497.551 | 0.160 | 0.689 |
| type | 2.593 | 2.593 | 1 | 505.408 | 1.508 | 0.220 |
| ratio:plot | 0.610 | 0.305 | 2 | 493.359 | 0.178 | 0.837 |
In Cleveland and McGill, confidence intervals were calculated using bootstrap sampling of subjects using the means of the midmeans. The same process is used here, although each subject did not receive each treatment combination.
The responses from the previous analysis used the log errors, but Generalized Linear Mixed Models are a modern option to analyze the data. Adjusting for the ratios as covariates, a beta regression model with unequal slopes is fit below:
\[\eta_{ijkl}=S_i+\beta_j R_{jk}+G_k+T_l\]
\(\eta_{ijkl}\) is the participant response
\(S_i\sim N(0,\sigma^2_S)\) is the effect of the \(i^{th}\) subject
\(\beta_j\) is the effect of the \(j^{th}\) true ratio
\(R_{jk}\) is the \(j^{th}\) ratio for the \(k^{th}\) graph type
\(T_l\) is the effect of the \(l^{th}\) comparison type
\(y_i|S_i\sim Beta(\mu_{ijkl}, \phi)\)
Link function: \(\eta_{ijkl}=\log[\mu_{ijkl}/(1-\mu_{ijkl})]\)
| plot | _plot | Estimate | Standard Error | DF | t-Value | Adj P | Odds Ratio | Adj Lower Odds Ratio | Adj Upper Odds Ratio |
|---|---|---|---|---|---|---|---|---|---|
| 2dDigital | 3dDigital | 0.21 | 0.06263 | 491 | 3.35 | 0.0025 | 1.234 | 1.065 | 1.429 |
| 2dDigital | 3dPrint | 0.07093 | 0.06164 | 491 | 1.15 | 0.4834 | 1.074 | 0.929 | 1.241 |
| 3dDigital | 3dPrint | -0.1391 | 0.06185 | 491 | -2.25 | 0.0642 | 0.87 | 0.752 | 1.006 |
Beta regression does not have easily interpretable results. These results are averaged over all ratios. Generally, the expected proportion responses follow that 2D digital > 3D printed > 3D digital
Results averaged over ratio, holding the comparison type constant
No results were significant, but what if we could incorporate the study at a larger scale?
Experiential learning with STAT 218 at University of Nebraska-Lincoln.