The s-CorrPlot is an interactive scatterplot for visually exploring pairwise correlation coefficients between variables in large datasets. Variables are projected as points on a scatterplot with respect to some user-selected variables of interest, driven by a geometric interpretation of correlation. The correlation of all other variables to the selected one is indicated by vertical gridlines in the plot. By selecting new variables of interest, a user can create simple tours of the correlation space through animations between different projections of the data. For more details, see our companion paper.
R
install.packages("devtools")
devtools::install_github("mckennapsean/scorrplot")
Once installed, the R-package scorr includes datasets and demos to help you. Below is a list of the names of the demos which show correlation between the given variables.
- gene1 : genes across different mice in two brain regions
- gene2 : genes across different mice in nine brain regions
- image : image patches across two different image datasets
- random : random variables, illustrating how to build a data frame for scorr
- subwayStops : subway stop ridership in different regions of NYC
- subwayTime : subway ridership across the years in NYC
Then, for any demo name above, you must open a terminal window and type:
R
library(scorr)
demo(random)
# press ENTER to continue the demo
Within the tool, you can interactively define new projections of the data by selecting new variables of interest in the plot or by selecting other data-driven projections, like the principal component bar charts. Animated transitions show the reorientation of the projection plane to these new selections. You can select primary and secondary variables of interest by left and right click, respectively.
There are also various keyboard commands within the tool, such as left and right arrow to adjust transparency of points in the s-CorrPlot. Furthermore, there are various queries from R that can pass data to and from the tool. For more details, please consult the package documentation.
The s-CorrPlot is integrated within the R statistical framework for data input and output, but the interactive tool is largely written in C++ and OpenGL. It has been tested on Mac OS X and Linux and will only run from a terminal window.
To compile the scorr R-package, you need the following components:
- R (version 2.10 or newer) + devtools
- Xcode (for Mac, install Xcode and download command line tools)
- OpenGL + GLUT (for Linux, GLUT library like freeglut)
To contribute to package development, please ensure you have a development environment that compiles the package. Then, download the code from GitHub into your own local folder.
Once you have entered that local folder in your terminal, simply enter the following commands from terminal to re-compile and test the package with any edits you make to the source:
R
devtools::load_all()
Then, you can test out the scorr tool inside R, such as:
data(gene1)
scorr(data)
Feel free to submit any issues or pull requests you have to us!
Simply remove the scorr package:
R
remove.packages("scorr")
This project's code is licensed by GPLv2.
The scorr package was originally developed as the tool gyroscope, written by Samuel Gerber.
If you have any difficulties or questions, please contact [email protected].