BME690 exam Questions
When comparing the difference between two population proportions, a pooled estimate of the population proportion can be used for two-tail tests where the null hypothesis assumes that the population proportions are equal. What is the alternate hypothesis? A. H1: p1 > p2 B. H1: p1 = p2 C. H1: p1 < p2 D. H1: p1 not equal to p2
D. H1: p1 not equal to p2
An independent t-test can be used to assess which of the following? A. It assesses differences between scores obtained on two separate occasions from the same participants B. It assesses relationships between two interval data sets C. It assesses how many factors there are in questionnaire data D. It assesses differences between two groups of participants
D. It assesses differences between two groups of participants
In a linear regression equation, Y=a + bX, what is the b denote? A. The score on the variable X. B. The correlation coefficient, the strength of the line C. The intercept with the Y-axis. D. The regression coefficient, the slope of the line.
D. The regression coefficient, the slope of the line.
Which one of these variables is a continuous random variable? A. The number of tattoos a randomly selected person has. B. The number of women taller than 68 inches in a random sample of 5 women. C. The number of correct guesses on a multiple-choice test. D. The time it takes a randomly selected student to complete an exam.
D. The time it takes a randomly selected student to complete an exam.
Five hundred (500) random samples of size n=900 are taken from a large population in which 10% are left-handed. The proportion of the sample that is left-handed is found for each sample and a histogram of these 500 proportions is drawn. Which interval covers the range into which about 68% of the values in the histogram will fall? A. .1 ± .020 B. .1 ± .0134 C. .1 ± .0167 D. .1 ± .010
D. .1 ± .010
Heights of college women have a distribution that can be approximated by a normal curve with a mean of 65 inches and a standard deviation equal to 3 inches. About what proportion of college women are between 65 and 67 inches tall? A. 0.50 B. 0.17 C. 0.75 D. 0.25
D. 0.25
The grade average in a class of 15 is 86%. If one additional student earns a 100% in the class, what is the new class average. A. 87.4% B. None C. 93% D. 86.875%
D. 86.875%
Which rule about nucleotide pairing is true? A. A is always paired with G: T is always paired with C B. G is always paired with T; A is always paired with C. C. there is no rule: any combination is possible D. A is always paired with T; G is always paired with C
D. A is always paired with T; G is always paired with C
A sampling distribution is the probability distribution for which of the following: A. A population parameter B. A population C. A sample D. A sample statistic
D. A sample statistic
In which of the following ways can qualitative research be applied? A. Where numbers are not required B. In advance of quantitative study, to help the researchers decide what questions to ask and how to ask them C. Alongside or after quantitative research to help explain the data, or perhaps to 'put flesh and blood' on the bones of the figures and numbers from the quantitative survey D. all of the above
D. All of the above
Which statement is not true about confidence intervals? A. A confidence interval is an interval of values computed from sample data that is likely to include the true population value. B. A confidence interval between 20% and 40% means that the population proportion lies between 20% and 40%. C. A 99% confidence interval procedure has a higher probability of producing intervals that will include the population parameter than a 95% confidence interval procedure. D. An approximate formula for a 95% confidence interval is sample estimate + margin of error.
D. An approximate formula for a 95% confidence interval is sample estimate + margin of error.
In the base graphics system, which function is used to add elements to a plot? A. Boxplot() B. Text() C. Treat() D. Both A and B
D. Both A and B
_________ refers to the extent to which a particular characteristic or trait in the population is attributable to genetic differences. A. Genetic drift B. Genetic diversity C. Genetic variance D. Heritability
D. Heritability
If the function in a console is.matrix(X) returns true then X can be considered as a ________ A. Matrix Vector B. Matrix object C. Vector D. Matrix data object
D. Matrix data object
The statement "If there is sufficient evidence to reject a null hypothesis at the 10% significance level, then there is sufficient evidence to reject it at the 5% significance level" is: Please select the best answer of those provided below. A. Always True B. Not Enough Information; this would depend on the type of statistical test used C. Never True D. Sometimes True; the p-value for the statistical test needs to be provided for a conclusion
D. Sometimes True; the p-value for the statistical test needs to be provided for a conclusion
A researcher is interested in the travel time of Utrecht University students to college. A group of 50 students is interviewed. Their mean travel time in 16.7 minutes. For this study the mean of 16.7 minutes is an example of a(n) A. Population B. Parameter C. Sample D. Statistics
D. Statistics
The 'standard deviation' is A. a measure of central tendency. B. a specific type of distribution of scores C. a specific type of table. D. a measure of dispersion.
D. a measure of dispersion
The histogram is broadly based on A. a set of dots joined up with straight lines B. A set of columns that lie on a vertical axis C. none of the options D. a set of columns that lie on a horizontal axis.
D. a set of columns that lie on a horizontal axis.
A single element of a character vector is referred as ________ A. Raw data B. data strings C. string D. character string
D. character string
Which of the following finds the maximum value in the vector x, exclude missing values A. x%in%y B. all(x) C. rm(x) D. max(x, na.rm=TRUE)
D. max(x, na.rm=TRUE)
_____________ function can be used to select the random sample of size 'n' from a huge A. Sample() B. simple() C. While() D. Signal()
Sample()
properties of normal distribution A. All of the options B. The mean, median, and mode are equal. C. The normal curve is bell-shaped and symmetric about the mean. D. The total area under the curve is equal to one.
A. All of the options
Statistical techniques that summarize and organize the data are classified as: A. Descriptive statistics B. Sample statistics C. Inferential statistics D. Population statistics
A. Descriptive statistics
The p-value in hypothesis testing represents which of the following: Please select the best answer of those provided below. A. The probability of observing results as extreme or more extreme than currently observed, given that the null hypothesis is true B. The probability that the observed results are statistically significant, given that the null hypothesis is true C. The probability of failing to reject the null hypothesis, given the observed results D. The probability that the null hypothesis is true, given the observed results
A. The probability of observing results as extreme or more extreme than currently observed, given that the null hypothesis is true
Failing to reject the null hypothesis when it is false is: A. Type II error B. alpha C. Type I error D. beta
A. Type II error
A dataframe named frame contains two numerical columns named A and B. Which of the following commands will draw a scatter plot between the two columns of the dataframe? A. plot(frame$A,frame$B) B. with(frame,plot(A,B)) C. All of the options D. ggplot(data = frame, aes(A,B))+geom_point()
A. plot(frame$A,frame$B)
In a random sample of 50 men, 40% said they preferred to walk up stairs rather than take the elevator. In a random sample of 40 women, 50% said they preferred the stairs. The difference between the two sample proportions (men - women) is to be calculated. Which of the following choices correctly denotes the difference between the two sample proportions that is desired?
A. pˆ−pˆ=0.10 B. p1−p2=−0.10 C. p1−p2=0.10 D. pˆ1 − pˆ2 = −0.10
What is the difference between data measured on an interval scale and data measured on a ratio scale? A. A ratio scale has a true zero point, so zero on the scale corresponds to zero of the concept being measured. B. A ratio scale puts scores into categories, while an interval scale measures on a continuous scale. C. An interval scale has a true zero point, so zero on the scale corresponds to zero of the concept being measured. D. A ratio scale has equal intervals between the points on the scale, whereas an interval scale does not.
A. A ratio scale has a true zero point, so zero on the scale corresponds to zero of the concept being measured.
The relationship between number of beers consumed (x) and blood alcohol content (y) was studied in 16 male college students by using least squares regression. The following regression equation was obtained from this study: y= -0.0127 + 0.0180x The above equation implies that: A. each beer consumed increases blood alcohol by an average of amount of 1.8% B. each beer consumed increases blood alcohol by exactly 0.018 C. each beer consumed increases blood alcohol by 1.27% D. on average it takes 1.8 beers to increase blood alcohol content by 1%
A. each beer consumed increases blood alcohol by an average of amount of 1.8%
The observation which occurs most frequently in a sample is the A. mode B. Median C. mean deviation D. standard deviation
A. mode
Which of the following will start the R program? A. $R B. @R C. *R D. >R
A. $R
A statistician wishes to determine the difference between two population means. A sample of 10 items from Population #1 yields a mean of 185 with a standard deviation of 20. The sample of 12 items from Population #2 yields a mean of 200 with a standard deviation of 25. Assume that the values are normally distributed in each population. How many degrees of freedom are there for this test? A. 11 B. 22 C. 21 D. 20
A. 11
The range of the data 14, 6, 12, 17, 21, 10, 4, 3 is A. 18 B. 12 C. 10 D. 16
A. 18
In Humans, each cell normally contains __________ of chromosomes. A. 23 pairs B. 46 pairs C. 11 pairs D. 32 pairs
A. 23 Pairs
The mean of the set {27,14,11,23, x} is 20. What is the mean of the set {x,2x,11,8,31}? A. 25 B. 20 C. 14 D. 27
A. 25
Which of the following would be synthesized using 5'-CAGTTCGGA-3' as a template? A. 3'-GTCAAGCCT-5' B. 3'-CAGTTCGGA-5' C. 3'-TCCGAACTG-5' D. 3'-AGGCTTGAC-4'
A. 3'-GTCAAGCCT-5'
The probability is p = 0.80 that a patient with a certain disease will be successfully treated with a new medical treatment. Suppose that the treatment is used on 40 patients. What is the "expected value" of the number of patients who are successfully treated? A. 32 B. 8 C. 20 D. 40
A. 32
the DNA sequence ATCAGCGCTGGC is part of a gene. How many amino acids are coded for by this message? A. 4 B. 20 C. 12 D. 8
A. 4
What must you include when reporting an ANOVA? A. All of the options B. F statistic C. Degrees of freedom D. P-value
A. All of the options
What is the class of b in the following R code? b<-c(TRUE, TRUE, 1) A. Numeric B. integer C. Logical D. Character
A. Numeric
Identify the scale of measurement for the following: military title -- Lieutenant, Captain, Major. A. Ordinal B. Interval C. Nominal D. Ratio
A. Ordinal
Which sequence of DNA bases would pair with this partial strand ATG TGA CAG A. TAC ACT GTC B. CAT TCA CTG C. ATG TGA CAG D. GTA AGT GAC
A. TAC ACT GTC
A sociologist focusing on popular culture and media believes that the average number of hours per week (hrs/week) spent using social media is greater for women than for men. Examining two independent simple random samples of 100 individuals each, the researcher calculates sample standard deviations of 2.3 hrs/week and 2.5 hrs/week for women and men respectively. If the average number of hrs/week spent using social media for the sample of women is 1 hour greater than that for the sample of men, what conclusion can be made from a hypothesis test where: 𝐻0: 𝜇𝑊 − 𝜇𝑀 = 0 𝐻1: 𝜇𝑊 − 𝜇𝑀 > 0 A. The observed difference in average number of hrs/week spent using social media is significant B. A conclusion is not possible without knowing the population sizes C. The observed difference in average number of hrs/week spent using social media is not significant D. A conclusion is not possible without knowing the average number of hrs/week spent using social media in each sample
A. The observed difference in average number of hrs/week spent using social media is significant
The correlation coefficient is used to determine: A. The strength of the relationship between the x and y variables B. none of the above C. A specific value of the y-variable given a specific value of the x-variable D. A specific value of the x-variable given a specific value of the y-variable
A. The strength of the relationship between the x and y variables
In a molecule of double-stranded DNA, the amount of Adenine present is always equal to the amount of A. Thymine B. uracil C. guanine D. cytosine
A. Thymine
DNA sequence is ACAGTGC. How would this be coded on mRNA? A. UGUCACG B. CACUGUA C. TGTCACG D. GUGACAU
A. UGUCACG
Which of the following units are repeatedly joined together to form a strand of DNA? A. Nucleotides B. Polysaccharides C. fatty acids D. amino acids
A. nucleotides
According to the central limit theorem, the sampling distribution of the sample mean can be approximated by the normal distribution as the A. sample size gets "large enought" B. sample standard deviation decreases C. number of samples gets "large enough" D. population standard deviation increases
A. sample size gets "large enough"
If there is a very strong correlation between two variables then the correlation coefficient must be A. much larger than 0, regardless of whether the correlation is negative or positive B. much smaller than 0, if the correlation is negative C. None of these alternatives is correct. D. any value larger than 1
B. much smaller than 0, if the correlation is negative
Which of the following statistical concepts is used to test differences in the means for more than two independent populations? A. Regression analysis B. Analysis of variance C. Multiple t test D. Confidence interval
B. Analysis of variance
The standard deviation of the sampling distribution of the sample mean is also called the A. population standard deviation B. standard error of the mean C. finite population correction factor D. central limit theorem
B. standard error of the mean
When the correlation coefficient, r, is close to one: A. it is impossible to tell if there is a relationship between the two variables B. there is a strong linear relationship between the two variables C. the slope of the regression line will be close to one D. there is no relationship between the two variables
B. there is a strong linear relationship between the two variables
If a command is not complete at the end of a line, R will give a different prompt, by default it is ____________. A. * B. + C. / D. -
B. +
The mode of the data 21, 26, 22, 29, 23, 29, 26, 29, 22, 23 is A. 22 B. 29 C. 26 D. 21
B. 29
A protein contains 30 amino acids. How many N-bases code for this protein? A. 60 B. 90 C. 120 D. 30
B. 90
What does ANOVA stand for? A. Analysis of values and averages. B. Analysis of variance C. Analysis of non ordinal values D. Analysis of values and averages.
B. Analysis of variance
What do we call the most basic structure of living things? A. Life B. Cell C. DNA D. Skin
B. Cell
What will be the output of the following R code? X<- c(3, 7, NA, 4, 7) y<- c(5, NA, 1, 2, 2) x+y A. 15.5 B. Missing Data C. 5 D. Symbol
B. Missing Data
The order in which participants complete a task is an example of what level of measurement. A. Interval B. Ordinal C. ratio D. Nominal
B. Ordinal
What are the conditions in which Type-I error occurs? A. None of the options B. The null hypotheses get rejected even if it is true C. The null hypotheses get accepted even if it is false D. Both the null hypotheses as well as alternative hypotheses are rejected
B. The null hypotheses get rejected even if it is true
Which nucleotide bases could be found in a molecule of RNA? A. adenine, guanine, cytosine, thymine B. adenine, guanine, cytosine, uracil C. sugar, phosphate, and base D. adenine, guanine, thymine
B. adenine, guanine, cytosine, uracil
In eukaryotic cells DNA has the appearance of a ___________. A. circle B. double C. single strand D. triple helix
B. double Helix
A dataframe named frame contains two numerical columns named A and B. Which of the following commands will draw a scatter plot between the two columns of the dataframe? A. with(frame,plot(A,B)) B. plot(frame$A,frame$B) C. ggplot(data=frame,aes(A,B))+geom_point() D. All of the above
B. plot(frame$A,frame$B)
Following is a scores from class test. Calculate the sample mean, standard deviation and variance. 29, 26, 13, 23, 23, 25, 17, 22, 17, 19, 12, 26, 30, 30, 18, 14, 12, 26, 17, 18 A. 20.85, 5.94, 35.29 B. 20.50, 5.79, 30.29 C. 20.85, 5.79, 35.29 D. 20.50, 5.94, 30.29
C. 20.85, 5.79, 35.29
What is the major attribute of Correlation Analysis? A. Variations among variables B. Regression among variables C. Association among variables D. Difference among variables
C. Association among variables
Human genome contains about A. 5 billion base pairs B. 4 billion base pairs C. 3 billion base pairs D. 2 billion base pairs
C. 3 billion base pairs
What would be the output of the following code? >x<- 1:4 >y<-6:9 >z<- x+y >z A. Null B. 7 9 11 13 14 C. 7 9 11 13 D. 9 11 13
C. 7 9 11 13
What assumptions(s) must be met to conduct an independent-sample t-test? A. The data from the dependent variable must be normally distributed B. There must be random sampling of cases C. All of the options D. The data from the dependent variable must be interval or ratio
C. All of the options
Assume that the difference between the observed, paired sample values is defined in the same manner and that the specified significance level is the same for both hypothesis tests. Using the same data, the statement that "a paired/dependent two sample t-test is equivalent to a one sample t-test on the paired differences, resulting in the same test statistic, same p-value, and same conclusion" is: Please select the best answer of those provided below. A. Sometimes True B. Never True C. Always True D. Not enough information
C. Always true
Characterizing molecular component is A. Proteomics B. Cheminformatics C. Bioinformatics D. Genomics
C. Bioinformatics
What will be the output of the following R code? a<-("a","b") mode(a) A. Complex B. Numeric C. Character D. Integer
C. Character
What does DNA stand for? A. Dehydrogenated acid B. Decreased Nature Alliance C. Deoxyribonucleic Acid D. Divine Nature Alliance
C. Deoxyribonucleic Acid
Elementary commands in R consist of either ________ or assignments. A. packages B. language C. expressions
C. Expressions
How will you check if an element is present in a vector? A. Dismatch() B. Search() C. Match() D. Mismatch()
C. Match()
R is a/an ________ programming language? A. GPL B. Definite source C. Open source D. Closed source
C. Open source
R comes with a ___________ to help you optimize your code and improve its performance. A. Debugger B. Monitor C. Profiler D. None of the above
C. Profiler
The standard error is a statistical measure of: A. The normal distribution of scores around the sample mean B. The degree to which a sample has been accurately stratified C. The extent to which a sample mean is likely to differ from the population mean D. The clustering of scores at each end of a survey scale
C. The extent to which a sample mean is likely to differ from the population mean
Which of the following sort of data frame by the order of the elements in B A. c.x[order(x$B),] B. b.x[ordersort(x$B),] C. a.x[rev(order(x$B)),]
C. a.x[rev(order(x$B)),]
Data frames can be converted to a matrix by calling data. ________ A. as.matr() B. as.matr() C. as.matrix() D. None of the above
C. as.matrix()
DNA does all but which of the following? A. provides the instructions for the synthesis of messenger RNA B. remains constant despite changes in environmental conditions C. Is read by ribosomes during the process of translation D. serves as the genetic material passed from parent to offspring
C. is read by ribosomes during the process of translation
Covariances between assets: A. can only fluctuate between -1 and +1 B. play an important role in determining a portfolio's expected return. C. measure the degree of dependency between two assets. D. can only be positive
C. measure the degree of dependency between two assets
What is the function to set row names for data frame? A. col.names() B. colnames() C. row.names() D. column name cannot be set for a data frame
C. row.names()
How is random sampling helpful? A. All of the options B. Free from personal biases C. An economical method of data collection D. Reasonably accurate
A. All of the options
All of the following are directly involved in translation except... A. DNA B. mRNA C. Ribosomes D. tRNA
A. DNA
According to the central dogma, which of the following represents the flow of genetic information in cells? A. DNA to RNA to protein B. DNA to protein to RNA C. RNA to DNA to protein D. protein to DNA to RNA
A. DNA to RNA to protein
Sex-linked genes are genes on A. All 23 pairs of chromosomes B. the Y chromosome only C. the Non-homologous parts of X and Y chromosomes D. the X chromosome only.
C. the Non-homologous parts of X and Y chromosomes
The median of observations 11, 12, 14, 18, x + 2, 20, 22, 25, 61 arranged in ascending order is 21. Find the value of x. A. 20 B. 25 C. 22 D. 19
D. 19
The covariance is: A. A measure of the strength of relationship between two variables. B. An unstandardized version of the correlation coefficient. C. Dependent on the units of measurement of the variables. D. All of the options.
D. All of the options.
What level of measurement would be used if participants were asked to choose their favorite picture from a set of six? A. Ordinal B. Ratio C. Interval D. Nominal
D. Nominal