Data Science 2

A group of problems that help you to build an accessible box plot from data loaded in from a spreadsheet.

Using Chart Libraries

Several chart types can be found in Quorums' libraries. This includes bar charts, box plots, histograms, scatter plots, and others. To program a chart you need a use block, a DataFrame with loaded data, a chart object, and the Display action block.

In this problem create a pie chart that displays the species frequency from the penguins.csv data set. First, add the use blocks for a pie chart and data frame. Then, create the data frame object and load the dataset. Finally, run the program to generate and display an accessible pie chart. Also, be sure to stop this type of program when you are done with the chart.

Selecting a Factor

Factors are the independent variables you are charting that can be either a column of text or number values. In this problem, try out each of the selected factor blocks to see each pie cart that is generated. If you need a reminder of the header names and data, look over the penguins.csv dataset.

Selecting Columns

Columns are the dependant variables you are charting. They are the columns of number values in a dataset. In this problem, try out each of the selected column blocks to see each bar cart that is generated. If you need a reminder of the header names and data, look over the penguins.csv dataset.

Creating Bar Charts

In this program, try out each of the bar chart generation options and consider the meaning each chart conveys. These bar chart options include, bar chart values that represent the minimum value, maximum value, mean of the values, and sum of the values. If you need a reminder of the header names and data, look over the penguins.csv dataset.

Creating Histograms

In this program, try out each of the histogram generation options and consider the meaning each chart conveys. These histogram options include a default histogram shape or a histogram with a bin width that you set. If you need a reminder of the header names and data, look over the penguins.csv dataset.

Creating Scatter Plots

Make two scatter plots. One scatter plot should select two columns and generate the scatter plot. The other should select two columns, one factor, and then generate the scatter plot. If you need a reminder of the header names and data, look over the penguins.csv dataset.

Creating Box Plots

Make two box plots. One box plot should select one column and generate the box plot. The other should select one column(bill_depth_mm), one factor(species), and then generate the box plot. If you need a reminder of the header names and data, look over the penguins.csv dataset.

Customizing Box Plots

Make and analyze two box plots. One box plot should select two columns(bill_depth_mm, bill_length_mm) and show outliers. The other should select two columns (bill_depth_mm, bill_length_mm) and hide outliers. If you need a reminder of the header names and data, look over the penguins.csv dataset.

Chart Titles

In this program, you will customize the box plot so that the title, subtitle, x-axis label, and y-axis label are more meaningful than the default. Begin by running the program and looking at the default titles. Then stop the program and changes each title before running the program again.

More Chart Customization

There are many options for customizing different types of charts. In this program, explore some of these options in the tray. You may decide to hide or show outliers, patterns, and the legend. You might even choose to change the position of the legend or flip the chart direction.

Make a Chart

Use what you learned about charts to make a custom chart using the penguins.csv dataset. Add and edit the blocks in the tray to create a chart that visualizes the data in a way that is interesting to you.