For example if 35% of your cases are females aged 2529, your 35% of your control group would be females aged 2529. How to conduct conditional cox regression for matched case control study. Chapter 17 introduces polytomous logistic regression using a case control study in which one type of case series is compared with two types of control groups. Using a comprehensive set of search criteria we identified 37 matched casecontrol studies for detailed analysis. Matched case control studies dependency within matched paircluster. A discordant pair occurs when the exposure status of case is different than the exposure status of the control. Conditional logistic regression has become a standard for matched casecontrol data to tackle the sparse data problem. Conditional logistic regression has become a standard for matched case control data to tackle the sparse data problem. Analysis of case control studies foodborne disease. If you do have a nested casecontrol study, i would be more. This module should be installed from within stata by typing ssc install cmatch.
In this matched study design, unlike in case control studies, there is. Compare matched groups on matching variables for non. Models for matched pairs stata textbook examples section 9. The data files used for the examples in this text can be downloaded in a zip file. The code below for creating the data set can be copied to the stata dofile editor and be executed through the dofile editor. Finally, conditional logistic regression is applied. Ill copy from the stata survival analysis and epidemiological tables reference manual. Stratified models for the odds ratio matched case control studies this section contains examples that illustrate the use of gmbopecan to fit stratified models for the odds ratio.
An introduction to matching and its application using sas. Teaching\stata\stata version 14\stata version 14 spring 2016\stata for categorical data analysis. Comprehensive summaries of uppsala dissertations from the faculty of social sciences 100. How to conduct conditional cox regression for matched case. History, case control methods up to modern times the sophisticated use and understanding of case control studies is the most important methodologic development of. Combining matched and unmatched control groups in case. Stratified analysis of casecontrol data in stata youtube. I have a dataset that has cases and controls matched on age, gender and number of years. In an analysis of a matched study design, only discordant pairs are used. Casecontrol study designs are used to estimate the relative risk for a disease from a specific risk factor. In a standard case control study, the investigator compares a group of controls with a group of cases. The significance of the association between a categorical risk factor and case status can be tested using mcnemars test, or by attaching a confidence interval to the odds ratio.
However, matching in a casecontrol study does not control for confounding by the matching factors. In case control studies, matching forces the ratio of cases to controls to be constant across all strata of the matched factors, which implies that the association between the confounders and the outcome is broken. Randomness in the matching is obtained using stata s runiform function. Is there any reading material which can give me an idea of doing analysis in a matched case control study. Applied logistic regression, second edition, by hosmer and lemeshow. In this paper i explore and illustrate these problems using a hypothetical pair matched case control study. Logistic regression for matched case control studies stata textbook examples the data files used for the examples in this text can be downloaded in a zip file from the wiley publications website. Aug 01, 2003 when matching is done in a case control study, those with the outcome are matched to those without the outcome on certain confounder measures. Similar results were found in another re analysis of breast cancer risk. Frequency matching is when cases and controls are matched by group, rather than by individual.
Logistic regression for matched casecontrol studies stata textbook examples. Create new dataset for matched demanders and suppliers 4. When you have used a casecontrol design for the diarrhea study, the actual crosstabulation is quite similar, only presence of diarrhea yesno, is now changed into cases and controls. There are two common misconceptions about casecontrol studies. For example, we might match the sex of the control to the sex of the case. This example adapted from intuitive biostatistics helps you use the mcnemars test calculator that is part of graphpad quickcalcs. Analysis of matched case control studies neil pearce1,2 there are two common misconceptions about case control studies. Nested casecontrol and case cohort studies an introduction and some new developments precourse.
Installation guide updates faqs documentation register stata technical services. Logistic regression for matched casecontrol studies idre stats. Analytic methods for matched case control studies include conditional logistic regression, conditioned upon the matching. The tests will performed in two statistical analysis packages, stata and. Title, the appropriate command for matched casecontrol data. The estimate is the odds ratio, which is a good estimate of the relative risk especially when the disease is rare. A propensity score is the predicted probability of an outcome. Syntax data analysis and statistical software stata. Analysis of case control studies the odds ratio or is used in case control studies to estimate the strength of the association between exposure and outcome. Regression models for casecontrol and matched studies 1 agenda quoted in breslow 1996. Statistical considerations in the analysis of matched case control studies. Those two will familiarize you with the basic commands that are commonly used in stata data management and analysis. How to find the controls from a subset of cases in an already case.
Analysis of epidemiological data using r and epicalc. The case control study is one of the most frequently used study designs in analytical epidemiology. The other basic type is a matched case control study. The idea in matching is to match upon a potential confounding variable in order to remove the confounding effect. Analysis of casecontrol studies the odds ratio or is used in case control studies to estimate the strength of the association between exposure and outcome. Stata does not limit itself to incidence rate cohort studies. Matched casecontrol studies are a classical epidemiology study design. In a matched study, we enroll controls based upon some characteristics of the case. This is clearly not the case in any 1m matched study. Openepi sample size for unmatched casecontrol studies. Again assume that sampling to case control study does only depend on.
A discussion of statistical methods for matched data. An introduction to categorical analysis by alan agresti chapter 9. Note that it is not possible to estimate the incidence of disease from a case control study unless the study is population based and all cases in a defined population are obtained. Can anyone advise me on the matched case control study. In casecontrol studies, matching forces the ratio of cases to controls to be constant across all strata of the matched factors, which implies that the association between the confounders and the outcome is broken. There is a presumption that matched data need to be analyzed by matched methods. There are two common misconceptions about case control studies. Capitalizing on the emerging consensus on the validity of the use of ulr with adjustment for matching variables for individually matched casecontrol studies, 18, 20, 22 we propose a strategy for analysis of casecontrol studies that use the concurrent design, which involves matching for time, a continuous variable. Analyze outcome variables for any significant group. This thesis focuses on some methodological aspects in the analysis of the results from this kind of study.
Case control studies are a common and efficient means. In both types of study cases and controls are sometimes matches. Teaching\ stata \ stata version 14\ stata version 14 spring 2016\ stata for categorical data analysis. When the resulting dataset is analyzed as a matched case control study, odds ratios will estimate corresponding rateratio parameters in the proportional hazards model for the cohort study. The appropriate command for matched casecontrol data. Statistical inference in matched casecontrol studies of. This study is used to investigate a cause of an illness by selecting a nonill person as the control and matching the control to a case.
Improving case control study validity by addressing drivers license sampling frame limitations. The subjects in the different case and control groups are matched on potentially. In order to perform the analysis, stata will require you to have casecontrol status coded as. This can be done using nested likelihood ratio tests as we did when. Author support program editor support program teaching with stata examples and datasets web resources training stata conferences. The primary focus here is on the analysis of matched case control studies using conditional logisticregression, but one can also fit stratified models using. Using propensity scores to reduce casecontrol selection bias.
However, matching in a case control study does not control for confounding by the matching factors. Stata has an addon program called clfit that will generate these diagnostic variables after clogit. In fact, the more standard analysis may not only be valid but may be much easier in practice, and yield bet ter statistical precision. Introduction matched case control data use of weights fixedeffects logit. This module should be installed from within stata by typing ssc install ccmatch. Matched case control studies are a classical epidemiology study design.
Individually matched case control study designs are frequently found in public health and medical literature, and conditional logistic regression is the tool most commonly used to analyze these studies. Sample size for unmatched case control studies, version 3. If you are using more than one control per case then this function also provides the reduction in sample size relative to a paired study that you can obtain using your number of controls per case dupont, 1988. An introduction to categorical analysis by alan agresti. Using a comprehensive set of search criteria we identified 37 matched case control studies for detailed analysis. Stratified models for the odds ratio matched case control. Multivariate analysis and hypothesis testing clogit. Sample size for matched casecontrol studies statsdirect. History, casecontrol methods up to modern times the sophisticated use and understanding of casecontrol studies is the most important methodologic development of unmatched cc study modern epidemiology rothman textbook 1986, p. A discussion of statistical methods for matched data analysisfor matched data analysis mingfu liu. For the love of physics walter lewin may 16, 2011 duration. If the matching is exact, accounting for the matching in the analysis will. Casecontrol studies can usually be conducted relatively faster and are inexpensive particularly when compared with cohort studies. Parsons, ovation research group, seattle, washington abstract a case control matched analysis is often used in observational studies to reduce selection bias and approximate a randomize d trial.
A population based case control study was conducted in northern norway and central. Applied logistic regression, second edition, by hosmer and lemeshow chapter 7. Crosstabulation in cohort and casecontrol studies the. The sparse data problem, however, may not be a concern for loose. Power analysis for matched casecontrol studies stata. A discussion of statistical methods for matched data analysis. Matching is often used in casecontrol control studies to ensure that the cases and controls are similar in certain characteristics, and it is a useful technique to increase the efficiency of the study. Stata module to match cases and controls using specified variables, statistical software components s457372, boston college department of economics, revised 27 jan 2015. When each case is matched to one control, we say that the study is 1. Statistical considerations in the analysis of matched case. Matched cohort methods for injury research epidemiologic. Matching is intended to eliminate confounding, however, the main potential benefit of matching in case control studies is a gain in efficiency.
Pdf elementary epidemiological data analysis using stata. However, matching in a case control study does not control for. Phd, dcs methodology journal club presentation oct 5, 2010oct 5, 2010 1. Matching on demographic variables is commonly used in casecontrol studies to adjust for confounding at the design stage. Analysis of frequency matched case control studies. Matching on demographic variables is commonly used in case control studies to adjust for confounding at the design stage.
Logistic regression for matched casecontrol studies. The matched pair case control study calculates the statistical relationship between exposures and the likelihood of becoming ill in a given patient population. Feb 25, 2016 however, there is a small increase in precision in the matched case control study compared with the unmatched case control studies 95% confidence intervals of 1. This means that for every case there is a control who has the same or closely similar values of the matching variables.
Frontiers unconditional or conditional logistic regression. This happens in epidemiological case control studies, where a possible risk factor is compared between cases and controls to investigate the nature of the disease. Jan 19, 2016 case control study o it is an observational study in which subjects are sampled based upon presence or absence of disease and then their prior exposure status is determined. Matching on factors such as age and sex is commonly used in casecontrol studies. Combining matched and unmatched control groups in case control studies article pdf available in american journal of epidemiology 16810. You might see tables that are flipped the layout of tables here is the following. Both exposure and outcome disease have occurred before the start of the study. Epi info provides interactive analysis of surveillance system data. Strategy for data analysis for casecontrol studies describe study profile. The 2,988 breast cancer cases were linked to the drivers license file to determine whether cases matched a record from the masterfile of drivers. Analysis of matched cohort data is not discussed in many textbooks or articles and is not mentioned in the stata manuals. Currently, sas does not offer the diagnostics discussed in this section.
1053 475 11 409 1242 1069 1408 978 200 923 166 794 1360 1327 1061 1062 236 1352 1225 158 788 764 1077 322 1265 712 1299 419 382 1332 319 959 639 308 1424 1015 60 402 1096 351 833 1097 667 213 135 1006 142