To Announcements

IT 403 -- Takehome Midterm Exam

Posted: October 5, 2016

Part A. Multiple Choice Questions

For each question, you may give a reason or show work for partial credit. You are required to give a reason or show your work if the question is marked with an asterisk (*). Questions 2, 3, 4, 5, 7, 8, and 11 require you to show a reason or work. 5 points each.

  1. What is the SPSS name for a categorical variable?
         a. Nominal    b. Ordinal    c. Quantitative    d. Scale
  2. *Which of the following could NOT easily be studied using a controlled experiment?
         a. Does a dietary supplement for dogs produce better teeth?
         b. Does a new diet pill cause persons to lose weight and keep it off?
         c. Does a new vaccine prevent persons from contracting AIDS?
         d. Does choice of political party influenced by religion?
  3. *Using the Tukey's Hinges method, the IQR for the list 32 57 59 79 94 104 186 is
         a. 15    b. 41    c. 47    d. 154
  4. *The SD+ for the list 7, 11 is
         a. 2.000    b. 2.828    c. 7.414    d. 8.000
  5. *If SD+ for a list is 46.32 when n = 17, what is SD for this list?
         a. 41.87    b. 44.94    c. 45.01    d. 47.01
  6. The distance between the inflection points of a normal histogram is
         a. σ    b. 2σ    c. μ-σ    d. μ+σ
  7. *Heights for human males are normally distributed with mean 5’8" (68") and SD = 3". How many persons out of one million (106) have a height over 6'8" (80")?
      a. Less than 1    b. 3    c. 32    d. 133
  8. *What is the 99.9th percentile for the male human heights in Problem 7?
      a. 6'3"    b. 6'5"    c. 6'7"    d. 6'9"
  9. What does the following normal plot indicate?
      a. Skewed to right b. Skewed to Left c. Thick Tails d. Thin Tails

  10. The scatterplot of a bivariate normal dataset looks like:
         a. an ellipse-shaped point cloud
         b. a circle-shaped point cloud
         c. a rectangle-shaped point cloud
         d. a bell-shaped curve
  11. *Compute correlation by hand of the x and y-variables:
         x: 1 2 3 4 5
         y: 1 3 4 2 5
    If you use SPSS, show a printout of the computed variables.
         a. 0.0    b. 0.4    c. 0.56    d. 0.7
  12. The proportion of variation of the dependent variable that can be explained by the variation of the independent variable is
         a. r     b. r2     c. r SDx SDy     d. r / (SDx SDy)

Part B: A Histogram

Bin1 Height2
[10, 20)   1.0
[20, 40)   1.5
[40, 50)   4.0
[50, 90]   0.5

1 Bin = Histogram interval
2 Units of Height: Percent per horizontal unit.

Using the preceding summary table, work the problems below. Remember that the area of each histogram rectangle is proportional to the percentage of observations in the bin that forms the base of each rectangle.

  1. (10 pts.) Draw the histogram. If you draw the histogram with typewriter graphics like the answers to the histogram problems in the lecture notes, use a monospaced font like Courier New or Lucida Console. Alternatively, you can draw the histogram with pen on paper and scan it in for your submission.
  2. (10 pts.) Compute Q1, Q2, and Q3 for the histogram.
  3. (5 pts.) Compute the mean of the histogram using a weighted average.
  4. (5 pts.) Estimate what percentage of observations are found in the bin (30, 50].
  5. (5 pts.) The list shown below represents the smallest and largest numbers that are summarized in the summary table for this problem. (The numbers that are not shown in this list are clearly not outlier candidates.) Draw the boxplot based on this list and on the quartiles computed in Part 2. Show your calculations for the inner and outer fences that you use to classify the outliers
         10.11  10.16  10.23  10.48  10.98 ...
         ... 74.01  73.55  76.98  88.34  89.98

    Note: computing the boxplot from the raw dataset would be more accurate than estimating the quartiles from the histogram, but this is the best that you can do if you don't have access to the entire raw dataset.

Part C: Essay Question

Part D: SPSS Analyses

Part E: Regression Problem