To Notes

IT 223 -- Jan 12, 2026

Review Problems

  1. Which R functions do you know? Answer:
    print  cat  c  seq  mean  median  IQR  
    summary  quantile  stem  hist  boxplot
    
  2. Compute Q1, Q3, IQR for this sorted dataset, using the Tukey's Hinges method.
    13  27  35  39  53  121  983 
    
    Answer: Q2 is the middle number 39, Q1 is the median of the bottom half of the data 31, Q3 is the median of the top half of the data 87. Recall that if n, the number of observations, is odd, that the middle number is included in both the bottom half and the top half, so
        Q1 = (27 + 35) / 2 = 31
    and
        Q3 = (53 + 121) / 2 = 87.
    Also
        IQR = Q3 - Q1 = 87 - 31 = 56.
  3. What information is contained in a boxplot?
    Answer: Q0, Q1, Q2, Q3, Q4, and any outliers, which are below the lower inner fence or above the upper inner fence.
  4. What does the term inner fence mean?
    Ans: The two inner fences are located at IF1 = Q1 - 1.5 * IQR and IF2 = Q3 + 1.5 * IQR.  Outliers are points which are less than IF1 or greater than IF2.
  5. What does it mean for two quantities to be linearly proportional? Give some examples. Answer:
    Example 1: Baking Bread. Define these quantities in cups:
    s1 = amount of sugar for original recipe
    s2 = amount of sugar for doubled recipe
    f1 = amount of flour for original recipe
    r2 = amount of flour for doubled recipe
    The amount of sugar and flour are proportional in the original and doubled recipes, which means
    s2 / s1 = f2 / f2.
    Example 2: Show that time and distance traveled are proportional when driving at a constant speed.
    Answer: show that t2 / t1 = d2 / d1.
    When driving at a constant speed, distance = time * velocity, so
    t2 / t1 = d2 / d1
    t2 / t1 = (v * t2) / (v * t1)
    The v in the numerator cancels with the v in the denominator, which makes the two sides of the equation equal.
  6. Given two rectangles that are the same height, show that the areas of the rectangles are proportional to their heights, using the fact that area = base * height or A = b * h for a rectangle. Answer:
    A2 / A1 = b2 / b1 = (h * b2) / (h * b1) = b2 / b1,
    where we cancel the h in the numerator with the h in the denominator.
  7. Compute the medians of the histograms from tables b and c for Problem 1 in More Practice Problems at the bottom of the Jan 7 Notes. The answer is shown at the end of this problem.
  8. Compute Q1 (25th percentile) and Q3 (75th percentile) of Table c from the previous problem.
    Answer: The calculations for Q1 and Q3.
  9. Draw the histogram without bar lines of
    1. (i) the incomes of all persons in the U. S.
      Answer: A histogram skewed to the right with a peak at about 65 thousand, but with a long right tail that extends all the way past 1 billion
    2. (ii) the GPAs of all students at DePaul. Answer: A bell-shaped histogram with peak around 3.0. There may be a secondary peak around 2.0, representing those students that have just come off of academic probation. There are very few students with GPA < 1.0 or GPA close to 4.0. Most GPAs will be between 30 and 3.5. The height of the histogram can only be nonzero in the range from 0 to 4.
    3. the number of years of schooling of all persons in the U. S.
      Answer: A bell-shaped peak around 12 years (most people finish highschool, less people attend college). There may be a secondary peak close to 16 years which represents the people that complete college.
    4. the IQs of all persons in the U.S. Answer: A bell-shaped curve with center at 100 and spread 15. IQ scores are scaled to look like this.

NIST 10 Gram Prototype Weight Weighings Dataset

Descriptive Statistics