Question 1

Rain is developing a psychological scale measuring social anxiety using 40 Likert-scale items. To assess how well these items measure the same construct, which reliability method should Rain use?

Accepted Answer

Cronbach’s Coefficient Alpha

Answer

Split-half reliability

Answer

KR-20

Answer

Inter-rater reliability

Question 2

Ryan is reviewing the reliability of a test where all items are multiple-choice questions scored as correct or incorrect. However, the items vary in difficulty. Which index should he use to evaluate internal consistency?

30

KR-20

KR-21

Rulon’s Formula

Cronbach’s Alpha

Accepted Answer

KR-20

Answer

KR-21

Answer

Rulon’s Formula

Answer

Cronbach’s Alpha

Question 3

Chuchay is constructing a speed test where each item has equal difficulty. Which internal consistency formula would best apply to her test?

Accepted Answer

KR-21

Answer

Spearman-Brown Prophecy

Answer

KR-20

Answer

Cronbach’s Alpha

Question 4

Eudoxia correlates scores from the odd-numbered and even-numbered items of her test to estimate internal consistency. Which method is she using?

Accepted Answer

Split-half reliability

Answer

Inter-scorer reliability

Answer

Cronbach’s Alpha

Answer

Test-retest reliability

Question 5

Chow used the Spearman-Brown Prophecy formula and got a reliability estimate of 0.65. To improve reliability, what should she do?

Accepted Answer

Increase the number of items

Answer

Remove all items with high discrimination

Answer

Administer the test twice

Answer

Switch to non-dichotomous items

Question 6

Ayan and Eve both rated a series of patient interviews using a categorical behavior scale. To measure the agreement between them, which statistic should be used?

Accepted Answer

Cohen’s Kappa

Answer

Fleiss Kappa

Answer

Spearman-Brown

Answer

Cronbach’s Alpha

Question 7

Yuki coordinated a research with five different raters categorizing behavioral responses. Which reliability coefficient should be used to evaluate the agreement among them?

Accepted Answer

Fleiss Kappa

Answer

KR-20

Answer

Cohen’s Kappa

Answer

Spearman-Brown Formula

Question 8

Nieve observed that some items on her test measured empathy while others assessed assertiveness. What does this indicate about the test’s internal consistency?

Accepted Answer

Low internal consistency due to heterogeneity

Answer

High homogeneity

Answer

Good construct validity

Answer

High reliability

Question 9

Miffy administered a test and then calculated the Average Proportional Distance (APD) to determine how similar item responses were. What is APD primarily used to assess?

Accepted Answer

Internal consistency

Answer

Inter-scorer reliability

Answer

Response style bias

Answer

Test-retest stability

Question 10

Ayan designed a test but accidentally split the items based on difficulty (e.g., easy in one half, difficult in the other) when computing split-half reliability. What error might this cause?

Accepted Answer

It may distort reliability due to biased item sampling.

Answer

It will lead to test fatigue.

Answer

It will reduce face validity.

Answer

It will artificially inflate internal consistency.

Question 11

Pablo is tasked to measure the consistency of a personality test administered twice to the same group two weeks apart. He finds a high correlation between both scores. What type of reliability is this?

Accepted Answer

Test-Retest Reliability

Answer

Inter-Scorer Reliability

Answer

Internal Consistency

Answer

Alternate Forms Reliability

Question 12

Stell noticed that the correlation between the test scores from Time 1 and Time 2 was inflated due to the test being administered just a day apart. What phenomenon most likely occurred?

Accepted Answer

Carryover Effect

Answer

Practice Effect

Answer

Test Wiseness

Answer

Test Sophistication

Question 13

Maloi designed two parallel forms of a musical aptitude test. She ensures that both have the same number of items, difficulty levels, and content coverage. Which error source is she minimizing?

Accepted Answer

Content Sampling

Answer

Scoring Variance

Answer

Practice Effect

Answer

Time Sampling

Question 14

Jhoanna is analyzing test scores using Classical Test Theory. She wants to compute the proportion of score variance due to true ability. What is this index called?

Accepted Answer

Reliability Coefficient

Answer

Validity Coefficient

Answer

Standard Error of Measurement

Answer

Item Discrimination Index

Question 15

Mikha observed that the observed score of a trainee fluctuated due to fatigue and weather conditions. This inconsistency reflects what kind of error?

Accepted Answer

Random Error

Answer

Systematic Error

Answer

Validity Error

Answer

Measurement Bias

Question 16

Denise developed a test to measure creativity using yes-no questions of varying difficulty. Which reliability estimate should she use to assess internal consistency?

Accepted Answer

KR-20

Answer

Cronbach’s Alpha

Answer

KR-21

Answer

Spearman-Brown

Question 17

Colet split her test items into odd and even numbered sets to check consistency. She then applied the Spearman-Brown Formula. What type of reliability is she estimating?

Accepted Answer

Split-Half Reliability

Answer

Inter-rater Reliability

Answer

Test-Retest Reliability

Answer

Alternate Form Reliability

Question 18

Anne and Elle independently rated applicants' performance in a dance audition. Their scores differed significantly. This inconsistency reflects what error?

Accepted Answer

Inter-Scorer Variance

Answer

Systematic Error

Answer

Time Sampling Error

Answer

Content Sampling Error

Question 19

Angela conducted a test but later learned half the participants didn’t return for the retest due to an overseas tour. What reliability threat is this?

Accepted Answer

Mortality

Answer

Carryover

Answer

Practice Effect

Answer

Test Wiseness

Question 20

Justin wants to use the most universally applicable form of reliability for two equivalent test versions. What method is best suited for his study?

Accepted Answer

Alternate Forms Reliability

Answer

Inter-Scorer Reliability

Answer

Split-Half

Answer

Test-Retest

Question 21

Jisoo received a raw score of 85 on an aptitude test. The manual reports a standard error of measurement (SEM) of 3. What does this imply at the 95% confidence level?

Accepted Answer

Her true score likely falls between 79 and 91.

Answer

Her true score is exactly 85.

Answer

Her true score is greater than 90.

Answer

Her observed score is less reliable than her predicted score.

Question 22

Jungkook is comparing two test scores to determine if they are significantly different. What statistical tool should he use?

Accepted Answer

Standard Error of the Difference

Answer

Standard Error of the Score

Answer

Standard Error of Estimate

Answer

Confidence Coefficient

Question 23

In a college entrance test, V (Taehyung) develops a new predictor. He notices that it accurately detects those who will succeed. Which concept is best reflected?

Accepted Answer

Sensitivity

Answer

False Negative

Answer

Confidence Interval

Answer

Specificity

Question 24

Sana was rejected for a role based on a test that predicted failure, but she later proved highly capable. What classification error was made?

Accepted Answer

False Negative

Answer

False Positive

Answer

True Positive

Answer

True Negative

Question 25

Suho analyzes his test's performance. He finds that it has high specificity. What does this tell him?

Accepted Answer

The test rarely misses capable candidates.

Answer

The test accurately detects those who will fail.

Answer

The test overestimates true scores.

Answer

The test is reliable but not valid.

Question 26

RM is conducting an item analysis for a dynamic (unstable) construct like mood. Which reliability estimate would best suit his test?

Accepted Answer

Internal Consistency

Answer

Alternate Forms

Answer

Split-Half

Answer

Test-Retest

Question 27

Woozi wants to improve his test’s internal consistency. Which strategy should he avoid?

Accepted Answer

Use items measuring different constructs.

Answer

Use factor analysis to refine the test structure.

Answer

Increase the number of items.

Answer

Conduct item analysis.

Question 28

Lisa is studying how accurate the predicted scores are from a regression model. Which standard error should she examine?

Accepted Answer

Standard Error of the Estimate

Answer

Standard Error of the Difference

Answer

Standard Error of Measurement

Answer

Standard Error of the Mean

Question 29

In an audition, Nayeon sees that only 5 trainees were hired out of 100 applicants. What is the selection ratio?

Accepted Answer

0.05

Answer

0.95

Answer

1.5

Answer

5.0

Question 30

Jin wants to know if his test can consistently identify individuals who truly have the talent. Which concept should he evaluate?

Accepted Answer

Sensitivity

Answer

Confidence Interval

Answer

False Positive Rate

Answer

Standard Error of Measurement

Question 31

During an assessment, Seokjin obtained a raw score of 85. The test has a mean of 70 and a standard deviation of 10. What is Seokjin’s Z-score?

Accepted Answer

1.5

Answer

0.85

Answer

2.0

Answer

1.0

Question 32

A test has a Z-score of -1.0. Using the T-score formula, what is the equivalent T-score?

Accepted Answer

40

Answer

35

Answer

45

Answer

30

Question 33

In a stanine system, a person who scores in the 77th percentile most likely falls into which stanine band?

Accepted Answer

Stanine 7

Answer

Stanine 4

Answer

Stanine 6

Answer

Stanine 5

Question 34

What is the main purpose of computing the Standard Error of Measurement (SEM)?

Accepted Answer

To estimate how close an observed score is to the true score

Answer

To determine if two scores are statistically different

Answer

To convert raw scores to standard scores

Answer

To compare two different scales

Question 35

You computed that Jennie’s observed score is 105 and the SEM is 5. What is her 95% confidence interval?

Accepted Answer

95 to 115

Answer

102.5 to 107.5

Answer

90 to 120

Answer

100 to 110

Question 36

Which of the following statements best describes the standard error of the difference (SEdiff)?

Accepted Answer

It estimates the difference between two test scores and their reliability

Answer

It predicts future performance based on a known score

Answer

It measures the error variance within a single test score

Answer

It converts raw scores into a bell curve

Question 37

A test has a reliability coefficient of 0.81, and the standard deviation of the test is 12. What is the SEM?

(Formula: SEM = SD√1 - r)

Accepted Answer

4.8

Answer

3.6

Answer

5.2

Answer

2.7

Question 38

Which of the following reflects a true positive in the context of psychological assessment utility?

Accepted Answer

An applicant was predicted to succeed and actually did

Answer

An applicant was predicted to fail and indeed failed

Answer

An applicant was predicted to succeed but failed

Answer

An applicant was predicted to fail but succeeded

Question 39

In a normally distributed test, what percent of scores fall between Z = -1 and Z = +1?

Accepted Answer

68%

Answer

34%

Answer

50%

Answer

95%

Question 40

You are comparing the scores of two examinees, Rosé and Jisoo, on a test with known reliability. You want to determine if their scores differ significantly. Which computation will help you most?

Accepted Answer

SE of the Difference

Answer

Confidence Interval

Answer

SE of Estimate

Answer

SEM

Question 41

Bells, a psychometrician, is asked to design a final exam for a college-level statistics course. She includes only multiple-choice questions that cover basic definitions and excludes computational or application items. What kind of validity might be threatened in this case?

Accepted Answer

Content Validity

Answer

Construct Validity

Answer

Criterion Validity

Answer

Face Validity

Question 42

Ria develops a screening test for ADHD and finds that her tool has a high correlation with an already validated ADHD checklist. Which form of validity is she providing evidence for?

Accepted Answer

Convergent Validity

Answer

Predictive Validity

Answer

Face Validity

Answer

Discriminant Validity

Question 43

A college entrance test predicts first-year GPA very well. Which type of validity is demonstrated?

Accepted Answer

Predictive Validity

Answer

Concurrent Validity

Answer

Content Validity

Answer

Construct Validity

Question 44

A school counselor selects a personality test because students feel that it seems relevant and appropriate to their self-image, although there’s no data supporting this. What type of validity is being described?

Accepted Answer

Face Validity

Answer

Construct Validity

Answer

Content Validity

Answer

Criterion Validity

Question 45

You are asked to evaluate a new job screening tool. The HR team wants to know whether the test adds anything beyond what their interview process already tells them. What should you evaluate?

Accepted Answer

Incremental Validity

Answer

Predictive Validity

Answer

Content Validity

Answer

Construct Validity

Question 46

You’re administering a depression inventory to a group of patients, some with diagnosed depression and some without. If your tool can clearly distinguish between the two groups, which method of validation are you using?

Accepted Answer

Method of Contrasted Groups

Answer

Content Mapping

Answer

Predictive Validity

Answer

Multitrait-Multimethod Matrix

Question 47

In a COVID-19 mental health screening tool, a high number of people without anxiety are flagged as having it. This suggests the test might be weak in which area?

Accepted Answer

Specificity

Answer

Content Validity

Answer

Concurrent Validity

Answer

Sensitivity

Question 48

During test construction, the developers gather a panel of experts to rate whether each item on a stress inventory is “essential,” “useful,” or “not necessary.” What psychometric method is being used here?

Accepted Answer

Content Validity Ratio (CVR)

Answer

Multitrait-Multimethod Matrix

Answer

Incremental Validity Index

Answer

Predictive Validity Analysis

Question 49

A clinical tool used for diagnosing anxiety yields consistent results across time but fails to correlate with known anxiety measures. What does this suggest about the test?

Accepted Answer

It has high reliability but poor construct validity

Answer

It has low reliability and low validity

Answer

It has high construct validity but low content validity

Answer

It has good predictive validity

Question 50

A researcher creates an academic aptitude test and finds that high scorers on the full test are consistently failing one particular item. What should they do next to improve construct validity?

Accepted Answer

Remove the item due to poor item-total correlation

Answer

Administer the test to a larger sample

Answer

Conduct a discriminant validity analysis

Answer

Use KR-20 to estimate internal consistency

Question 51

A test developer reports a validity coefficient of 0.85 between a new job performance test and supervisor ratings. What does this suggest about the test?

Accepted Answer

The test has high predictive validity

Answer

The test is moderately valid

Answer

The test cannot be trusted due to bias

Answer

The test has little to no predictive ability

Question 52

Which of the following best represents an acceptable validity coefficient for a psychological test used in basic research?

Accepted Answer

0.40

Answer

0.90

Answer

0.00

Answer

0.20

Question 53

If a test shows a validity coefficient of 0.10, what can we conclude?

Accepted Answer

The test shows negligible validity

Answer

The test must be reliable

Answer

The test can be used for high-stakes decision making

Answer

The test is highly valid

Question 54

Dee administers an entrance exam with a criterion validity coefficient of 0.75 with first-year college GPA. What does this imply?

Accepted Answer

The test strongly predicts college GPA

Answer

The test cannot be used for academic placement

Answer

The test is unreliable

Answer

The test needs to be revised

Question 55

Which of the following best explains a validity coefficient of 1.00?

Accepted Answer

This is likely due to error or overfitting

Answer

The test is moderately valid

Answer

The test has zero error and is perfectly valid

Answer

It shows random association

Question 56

A test developer finds that the new scale has a construct validity coefficient of 0.50. This suggests:

Accepted Answer

Moderate support for the test measuring the intended construct

Answer

The test must be revised

Answer

Poor evidence of construct validity

Answer

Strong discriminant validity

Question 57

Which statement about validity coefficients is TRUE?

Accepted Answer

They range from 0 to 1, like reliability coefficients

Answer

They reflect the test’s ability to produce consistent scores

Answer

They are expressed in squared values

Answer

They are always higher than reliability coefficients

Question 58

Eshie wants to justify the use of a new leadership assessment tool. She should aim for a minimum validity coefficient of:

Accepted Answer

0.30

Answer

0.15

Answer

0.90

Answer

0.50

Question 59

What does a validity coefficient of 0.00 imply?

Accepted Answer

There is no relationship between the test and the criterion

Answer

The test perfectly measures the construct

Answer

The test is highly reliable

Answer

The test is highly predictive of future performance

Question 60

If a test has a validity coefficient of 0.60, what percentage of the variance in the criterion can be explained by the test?

Accepted Answer

36%

Answer

6%

Answer

96%

Answer

60%

Related quizzes

Related quizzes