Chapter 6: Assessing the Validity and Reliability of Measures

(F) Day of the week: Tuesday Class: IS303 Created Time: December 8, 2020 12:47 PM Database: Class Notes Database Date: December 8, 2020 12:47 PM Days Till Date: Passed Last Edited Time: June 9, 2021 10:42 AM Type: Presentation Notes, Reading Notes

Introduction

In translating mental abstractions into empirical ideas we need to look at problems of accuracy during translation.

Validity: the extent to which it is measuring what is claimed to measure
Reliability: ability to deliver consistent or stable results

1. Measurement Validity

To test whether a variable measure what it claims to measure:

Ordering from least to more rigorous assessment

1.1. Face Validity

The assessment of validity by seeing if it looks right (logical) on the surface

Lack Validity

It’s the minimum a research should do. It is subjective, different people have different assessment

Ex: You cannot measure people’s happiness by whether people stomp their feet.

1.2. Content Validity

Assesses if conceptual and nominal definition over lap each other:

Conceptual Definition: theoretical clarification of concepts
Nominal Definition: steps involved in documenting the concept

If the purpose of the question isn’t the same as the theory in question, its not content valid

How would you measure if it related or overlap, is it subjective?

1.3. Criterion Validity

Assessment using empirical evidence

Predictive Validity: using empirical data/evidence to create predictions (picking the right variables)

If the outcome is the same as the prediction it’s valid
Concurrent Validity: using existing valid measure to demonstrate accuracy of another measure of the same variable

How would you do that? Ex?

1.4. Construct Validity

Demonstrating accuracy by producing the same results as theoretically based hypotheses or predictions

Gathering empirical data that comes to the same result to the theory

Ex:

$a + a = 2 a 2 + 2 = 2 \times 2$

2. Reliability Check

The ability to produce or yield consistent or stable results

2.1. Test-Retest

Simplest technique

Assessing consistency by measuring twice and looking for similarities and difference on the same subject

Same results = reliable

The importance of time delay:

Long Time Delay: subjects could change due to time since last test
Short Time Delay: subjects may be able to remember the answers they picked

2.2. Multiple-Forms Test

Assessing consistency by using two forms for the same subject asking for the same information

Changing the way they ask questions

Asking for age:

What is your age?
In what your were you born?

Problem: changing the wording of questions might change the meaning or measurement of the question itself

2.3. Split-Half Test

Assessing consistency splitting measure into two halves of a test to see if both groups of answers correlate to the same result

Cronbach’s Alpha: the coefficient that is frequently used to report reliability

From 0 - 1
0.7 is considered reliable

3. Measurement Errors

3.1. Noise Bias

Measurement is the estimate of the true value

$X = T + S + R ⎩ ⎨ ⎧ T: True Value S: Systematic Bias R: Randomness$

There are many measurement errors

3.2. Noise

The error that has no direction or reason

Unintended error due to random chance

With many measures noise will cancel each other out and will not be significant

3.2.1. Research Subject

Noise from the subjects’ incompetency in answering:

Subject is too tired to provide normal competent answers
Subject may be too young to provide accurate answer
Subject is inattentive, or not concentrating

3.2.2. Human Diversity

different people will answer differently, if the sample group is too diverse it’s very noisy

3.2.3. Poorly Designed Questions

Random Questions that leads to random answers, not relating to the point of the research

3.3. Bias

Error that has a pattern or consistent direction (overly or under estimate)

Sources of Bias

Expectation: Researchers have their theory in mind and might use vague/ambiguous data to confirm their biased theory
Biased Tool: can have bias towards one answer or another
Threatening Condition: Subjects might answer questions that researchers might want or expect

Reduce bias: by introducing noise as it can cancel each other out

Noise is better than Bias

Bias might be malicious in nature and intentional, hard to correct
Noise isn’t intentional and will cancel itself out in time and scale

🪴 Quartz 4.0

Explorer

Chapter 6 Assessing the Validity and Reliability of Measures

Chapter 6: Assessing the Validity and Reliability of Measures

Introduction

1. Measurement Validity

1.1. Face Validity

1.2. Content Validity

1.3. Criterion Validity

1.4. Construct Validity

2. Reliability Check

2.1. Test-Retest

2.2. Multiple-Forms Test

2.3. Split-Half Test

3. Measurement Errors

3.1. Noise Bias

3.2. Noise

3.2.1. Research Subject

3.2.2. Human Diversity

3.2.3. Poorly Designed Questions

3.3. Bias

Graph View

Table of Contents

Backlinks

🪴 Quartz 4.0

Explorer

Chapter 6 Assessing the Validity and Reliability of Measures

Chapter 6: Assessing the Validity and Reliability of Measures §

Introduction §

1. Measurement Validity §

1.1. Face Validity §

1.2. Content Validity §

1.3. Criterion Validity §

1.4. Construct Validity §

2. Reliability Check §

2.1. Test-Retest §

2.2. Multiple-Forms Test §

2.3. Split-Half Test §

3. Measurement Errors §

3.1. Noise Bias §

3.2. Noise §

3.2.1. Research Subject §

3.2.2. Human Diversity §

3.2.3. Poorly Designed Questions §

3.3. Bias §

Graph View

Table of Contents

Backlinks

Chapter 6: Assessing the Validity and Reliability of Measures

Introduction

1. Measurement Validity

1.1. Face Validity

1.2. Content Validity

1.3. Criterion Validity

1.4. Construct Validity

2. Reliability Check

2.1. Test-Retest

2.2. Multiple-Forms Test

2.3. Split-Half Test

3. Measurement Errors

3.1. Noise Bias

3.2. Noise

3.2.1. Research Subject

3.2.2. Human Diversity

3.2.3. Poorly Designed Questions

3.3. Bias