DO-IT Summer Study
Quorum Background (5 Minutes)
The Quorum Programming Language is developed at the University of Nevada, Las Vegas. It was largely created with two goals in mind: 1) Make programming accessible for people with disabilities and 2) Use evidence-based decisions about design and syntax choices to make the language easier to use.The language supports a variety of applications, such as lego robotics, audio processing, 2D and 3D game development, as well as many more. The current work being done is the addition of data science tools that are fully accessible.
Introduction to Charts (5 Minutes)
When presenting data to others, our job is to translate our findings into a representation that makes the data easier to grasp. One common approach to do this is to present charts, which contain a visual component and can also be accessible with some care. Presenting our data as a chart allows us to quickly explain our analysis and draw conclusions from datasets we have collected without needing to present complex academic statistical analysis. In a sense, charts let us get the gist of what our data is telling us, supposing the chart is fairly presented.
Let us imagine what it would be like if we had no way to present our data beyond the numbers themselves. We would have millions of values to explain and we would not be able to showcase trends easily that make sense to others. Charts can help us present our information better and help us remember our data versus examining a dataset. We can discuss how data charts are represented all throughout education and industries, providing examples of each.
How Charts Relate to Data Science (5 Minutes)
Charts are important outside of academic settings, where many data scientists use data visualization to represent the information found on datasets. Since data in real life often involves large amounts of data, it's more important than ever that we can actually condense that information into something more meaningful.
Showing the importance of graphical charts in data science allows us to identify and understand massive amounts of information. In Quorum, we have seven different chart types to represent data: Bar Charts, Pie Charts, Box Plots, Histograms, Violin Plots, Line Charts, and Scatter Plots. We will be learning how to use a few of these charts today.
Chart Types (5 Minutes)
Different charts are used to convey different types of information, meaning, there is no single chart we can use to represent all data. Today, we're going to focus on just two of these chart types: Histograms and Scatter Plots.
Histograms
A very common chart for visualizing numeric data is through a histogram. Histograms are typically used to represent the distribution of numerical data. The variable we observe is divided into different intervals where as a whole, we can examine the shape of the graph whether it is skewed, normal, uniform, bimodal [two distinct curves], etc. and draw conclusions about the data. Typical datasets to look out for when wanting to construct a histogram would be datasets featuring costs, ages, GPAs, and test scores. Note that datasets with non-numeric data are not recommended for histograms.
Scatter Plots
The next chart we will be learning to create would be a scatter plot. Scatter plots are used to observe relationships between two variables. An example of this would be comparing the heights and diameters of trees, where the position of each dot correlates to that specified height and diameter. The data as a whole can represent a relationship: strong positive/negative linear, moderate positive/negative linear, or no relationship. As a data scientist, one of the most important patterns is seeing how points cluster from other points, if there are any gaps within the dataset, and identifying outliers for within the set. We focus on these aspects to make predictions on future datasets and understand trends.
Accessible Chart Navigation (10 Minutes)
Histogram
Scatter Plot
Additional Resources
Core Track: A good place to start for beginner programmers (not data science specific). This learning track will teach you the basics of types/variables, control structures, actions and classes.Data Science Professional Development Track: This learning track will teach you about data science programming in Quorum. It takes you from start to finish with making charts.Quorum Reference Pages: This site provides documentation on all of the pieces of Quorum. Some relevant sections to the data science content include Data Science, Data Transformations, Descriptive Statistics and Charts.
Next: Activity
In the next section, we will work through an Activity as a group.