You have 3 free guides left 😟
Unlock your guides
You have 3 free guides left 😟
Unlock your guides

Missing data and attrition can mess up your study results big time. They're like sneaky gremlins that can make your findings less trustworthy. But don't worry, there are ways to deal with them.

You've got options like deleting incomplete cases or filling in the blanks with educated guesses. The key is picking the right method for your situation. It's all about keeping your results solid and your conclusions strong.

Missing Data Patterns and Reasons

Types and Patterns of Missing Data

Top images from around the web for Types and Patterns of Missing Data
Top images from around the web for Types and Patterns of Missing Data
  • Missing data encompasses incomplete information in datasets, while attrition involves participant loss over time in longitudinal studies
  • Common missing data patterns include:
    • (MCAR): No systematic relationship between missingness and any values
    • (MAR): Missingness related to observed variables but not unobserved ones
    • (MNAR): Missingness related to unobserved variables
  • Visualization techniques help identify missing data patterns:
    • Heat maps display missingness across variables and observations
    • Missingness patterns plots show combinations of missing variables

Causes and Implications of Missing Data

  • Reasons for missing data stem from various sources:
    • Participant non-response (survey fatigue, sensitive questions)
    • Data collection errors (equipment malfunction, human error)
    • Technical issues in data management (data corruption, software glitches)
  • Attrition occurs due to factors such as:
    • Loss of interest in the study
    • Participant relocation
    • Death or incapacitation of participants
  • Systematic patterns in missing data or attrition introduce bias and affect study validity:
    • emerges when missingness relates to unobserved characteristics influencing outcomes
    • between treatment and control groups compromises group comparability
  • Statistical tests assess missing data randomness:
    • evaluates the null hypothesis that data are MCAR
    • compare characteristics of cases with and without missing data

Handling Missing Data

Deletion Methods

  • removes all cases with any missing data:
    • Advantages include simplicity and unbiased estimates under MCAR
    • Disadvantages involve loss of statistical power and potential bias under MAR or MNAR
  • uses all available data for each analysis:
    • Benefits include preserving more data than listwise deletion
    • Drawbacks include potentially different sample sizes for different variables, complicating interpretation

Imputation Techniques

  • replaces missing values with observed value means:
    • Simple to implement but underestimates variability and distorts relationships between variables
  • predicts missing values based on other variables:
    • Preserves relationships between variables but may overstate precision
  • creates several plausible datasets, analyzes each, and combines results:
    • Accounts for uncertainty in imputed values and provides valid standard errors
    • Requires careful specification of the imputation model
  • replaces missing values with observed values from similar cases:
    • Maintains the distribution of observed data
    • Can be challenging to implement for complex datasets

Advanced Methods

  • uses all available data to estimate parameters:
    • Provides unbiased estimates under MAR and maintains statistical power
    • Can be computationally intensive for complex models
  • The choice of imputation method depends on various factors:
    • Pattern and mechanism of missingness (MCAR, MAR, MNAR)
    • Research context and specific analysis requirements
    • Available computational resources and software capabilities

Impact of Missing Data on Estimates

Validity Concerns

  • Missing data and attrition potentially threaten internal and :
    • compromised when missingness relates to treatment effects
    • External validity affected when attrition leads to a non-representative sample
  • The impact on validity varies based on:
    • Amount of missing data (percentage of incomplete cases)
    • Pattern of missingness (MCAR, MAR, MNAR)
    • Relationship between missingness and outcomes of interest
  • Differential attrition between treatment and control groups affects causal inference:
    • Introduces systematic differences between groups
    • May lead to over- or underestimation of treatment effects

Statistical Considerations

  • Power analysis assesses the impact of reduced sample size:
    • Determines if remaining sample provides sufficient statistical power
    • Guides decisions on whether to adjust sample size or analysis plans
  • Comparison of baseline characteristics between complete and incomplete cases:
    • Helps identify potential sources of bias
    • Informs the choice of appropriate missing data handling methods
  • Reporting guidelines recommend transparent documentation:
    • CONSORT for randomized trials requires detailed reporting of attrition
    • STROBE for observational studies emphasizes clear description of missing data

Sensitivity Analyses for Robustness

Imputation-Based Sensitivity Analyses

  • Multiple imputation with different predictor sets assesses result sensitivity:
    • Varies the variables included in the imputation model
    • Compares results across different imputation specifications
  • Worst-case and best-case scenario analyses bound the potential impact:
    • Imputes extreme values for missing data (minimum and maximum plausible values)
    • Provides a range of possible results under different assumptions

Model-Based Sensitivity Analyses

  • Pattern mixture models explore sensitivity to missing data mechanism assumptions:
    • Incorporates different assumptions about the distribution of missing data
    • Allows for MNAR mechanisms to be modeled explicitly
  • Tipping point analysis determines the extremity of missing data needed to change conclusions:
    • Systematically varies assumptions about missing data
    • Identifies the point at which results would change significantly

Comparative Approaches

  • Comparing results from complete case analysis with imputation methods:
    • Reveals the potential impact of different missing data handling choices
    • Helps assess the robustness of findings across methods
  • Graphical methods visually represent result sensitivity:
    • Tornado plots display the impact of different assumptions on key outcomes
    • Forest plots compare effect estimates across various sensitivity analyses
© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.


© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.

© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.
Glossary
Glossary