Quantitative Skills

Variables

Experimental science looks at cause and effect types of relationships. Controlled experiments vary one of the factors or traits  to observe the effect on another factor or trait. These factors are called variables. A dependent variable is something that is observed and expected to change as a result of modifying another factor in the experiment. That is to say, the outcome depends on another factor.  Another name for dependent variable is responding variable. The independent variable is the factor or condition that is changing or being changed by the experimenter. Sometimes waiting is the the condition that is changing, making the independent variable : time. Since we change the independent variable, it is also called the manipulated variable.

Graphing a line

Linear Function Graph

A graph displaying 2 lines and their equations

 A line can be described mathematically by the equation:

y= mx + b

This is referred to as the slope-intercept form. The 2 variables y and x refer to coordinates along each axis. The term m refers to the change in the y-values over the change in the x-values. This is referred to as the slope of the line. The term b, the y-intercept, is the y-value where the line crosses the y-axis.

m = \frac{y_2-y_1}{x_2-x_1} OR m = \frac{\Delta y}{\Delta x}
This is how the slope of a line (m) is determined.

The slope of the line indicates the relationship between the two variables, x and y. The equation of the line y= mx + b indicates to us that “y” occurs as a function of changes to “x”. Sometimes this is represented by the equation f(x)= mx + b . Since “y” depends on “x”, “y is the dependent variable and “x” is the independent variable.  As “x” changes, how does “y” change in response? This is what the slope reveals to us.

For more review, visit the following link.

Slope activity

Graphing Lines

Click on image to run the simulation

 Data Plotting Activity

  1. Use the following data from the USDA to plot a scatterplot and generate a trend-line.
  2. Use the info from the table below or download (a text file that can be opened in notepad/textedit)
  3. follow a tutorial on how to graph a scatterplot with line of best fit (trend-line)
  4. or use this tutorial in plot.ly
      • remember NOT to make a line graph
      • Ensure that “Year” is the x-axis
      • Ensure that “lbs. Mozzarella” is the y-axis
      • Show the equation of the trend-line on the graph
Year lbs. Mozzarella
2000 9.33
2001 9.70
2002 9.66
2003 9.65
2004 9.94
2005 10.19
2006 10.52
2007 11.02
2008 10.57
2009 10.63

What do we learn?

  1. What does this scatterplot tell us about the relationship between consumption of mozzarella in relationship to years?
  2. How would this graph influence the way you invest in a mozzarella cheese company? Can you predict anything about the future of cheese consumption?
  3. What does the slope of this line indicate to you?
    1. Use mathematics to illustrate this point.
    2. The slope has a unit related to “lbs.” and “year”, what is this unit?

Creating a Line of Best Fit

Not all points collected will fall on a straight line. A Line of Best Fit or a Trendline approximates the average of those points through a mathematical process called the least squares method. While one could “eyeball” this line, the least squares method uses the data to minimize the distance from all those points to the line to have an averaging effect.

  1. Create a column of data for “x” values, “y” values, x2 and xy
  2. At the bottom of these columns, sum the data. Σx, Σy, Σx2, Σxy
  3. Calculate the slope from these values
    • m = \frac{N (\Sigma xy)- \Sigma x \Sigma y}{N (\Sigma x^2)-(\Sigma x)^2} (where N=number of entries in a column)
  4. Calculate the y-intercept from these values
    • b = \frac{N(\Sigma y) - m (\Sigma x)}{N}
  5. Which provides use the the function f(x)= mx + b
Least-Squares Regression

Click on image to run the simulation