Nothing Special   »   [go: up one dir, main page]

Exam Question

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 8

Exam time is 1 hour and Total points for the Exam 100 points

Good Luck!

Question 1-18 -3.5 points


Question 19,20,22,23,24,25-5 points
Question 21-7 points

Name,Surname:

1.Which of the following is NOT typically considered a necessary skill or area of knowledge for
a data scientist?
A) Understanding of programming languages like Python and R
B) Expertise in graphic design and multimedia creation
C) Proficiency in math and statistics, including linear algebra and calculus
D) Domain expertise relevant to the data's industry or field

2.In the context of data science, 'Domain Expertise' refers to:


A) Expertise in managing and maintaining domain servers
B) Understanding of specific programming domains like web or mobile
C) In-depth knowledge of a particular industry or field from which data is collected
D) Mastery of domain-specific languages like DSL

3.Which of the following best describes first-party data in the context of data analytics?
A) Data collected and aggregated from numerous sources by a third-party organization
B) Data that a company has directly collected from its customers
C) Data purchased from other organizations
D) Data available through public databases

4.In the data cleaning process, which of the following is NOT a key task?
A) Removing major errors, duplicates, and outliers
B) Extracting irrelevant observations
C) Filling in major gaps in the data
D) Increasing the size of the dataset by adding more variables

5. Which of the following statements are true?


A)The standard deviation of a variable represents approximately the average distance from the
mean.
B)The wider the spread of the data, the larger the standard deviation.
C)The standard deviation is the best representation of the spread of variables that have outliers or
extreme values.
D)The upper quartile is the 25th percentile of the ordered data
6.A researcher is interested in the effects of loud music on learning. A group of 30 students is
selected to participate in a research study. The group of students is...?
A)The population.
B)The standard deviation.
C)The independent variable.
D)The sample.

7.Which of the following is an example of a quantitative variable (also known as a numerical


variable)?
A) the color of an automobile
B) a person’s state of residence
C) a person’s zip code
D) a person’s height, recorded in inches
E) Choices (C) and (D)

8.Which of the following is an example of a categorical variable (also known as a qualitative


variable)?
A) years of schooling completed
B) college major
C) high-school graduate or not
D) annual income (in dollars)
E) Choices (B) and (C)

9.Which of the following data sets has the same standard deviation as the data set with the
numbers 1, 2, 3, 4, 5?
A) Data Set 1: 6, 7, 8, 9, 10
B) Data Set 2: –2, –1, 0, 1, 2
C) Data Set 3: 0.1, 0.2, 0.3, 0.4, 0.5
D) Choices (A) and (B)
E) None of the data sets gives the same standard deviation as the data set 1, 2, 3, 4, 5.

10. Which of the following are measures of central tendency?


a)Mean
b)Standard deviation
c)P value
d)Range (minimum to maximum)
e)Variance
f)Median
g)Confidence interval
h)Mode
11.Which of the following data sets has a mean of 15 and standard deviation of 0?
A) 0, 15, 30
B) 15, 15, 15
C) 0, 0, 0
D) There is no data set with a standard deviation of 0.
E) Choices (B) and (C)

12.What are properties of the normal distribution?


A) It’s symmetrical.
B) Mean and median are the same.
C) Most common values are near the mean; less common values are farther from it.
D) Standard deviation marks the distance from the mean to the inflection point.
E) All of the above.

13. The heights (cm) of 6 children were measured as 141, 155, 130, 146, 141, 134.
What is the range of the data?
a)11
b)14
c)17.68
d)25
e)34.66

14.Which of the following is not a measure of the spread (variability) in a data set?
A) the range
B) the standard deviation
C) the IQR
D) the variance
E) none of the above

15. Which of the following could only be expressed as a discrete variable? Choose one:
AA)person's height
B)the number of cars at a car show
C)the temperature outside
D)the width of a desk

16.The following box plots represent GPAs of students from two different colleges, call them
College 1 and College 2
What information is missing on this graph and on the box plots?
A) the total sample size
B) the number of students in each college
C) the mean of each data set
D) Choices (A) and (B)
E) Choices (A), (B), and (C)

17.Find the mean, mode and median of the given sets of data: 5,8,12,17,12,14,6,8, 12, and 10

A)11,12,10
B)10,12,13
C)11,12,13
D)10,12,11

18 .The mean of a sample is:


A) always equal to the mean of the population
B) always smaller than the mean of the population
C) computed by summing the data values and dividing the sum by (n - 1)
D) computed by summing all the data values and dividing the sum by the number of items
E) None of the above answers is correct.

19.Write Data Analysis Process Steps

20.Explain Monitor and Validate process in Data Science(2 sentences is enough)

21. The table below displays a selection of variables from a study dataset.
Which of the above variable(s) are classified as quantitative variable(s)?

22. What is the statistical name for the 50th percentile?


23. Calculate the median from the following data:

24.What is statistics?
25. Seven students got the following exam scores (percent correct) on a science exam: 0%, 40%,
50%, 65%, 75%, 90%, 100%. Which of these exam scores is at the 50th percentile

You might also like