All code blocks can be copied by clicking the clipboard icon in the upper right corner. If the icon is hidden, hovering your mouse cursor in the area should reveal it.
Scrolling Content
In come cases, content may wrap beyond the limits of the slide. It may be necessary to scroll up/down or left/right.
Some Caveats
Let’s Keep it Simple, Silly
My goal is to focus largely on conceptual issues related to SLCMA and studies of DNA methylation. Although I briefly discuss the slcma R Package created by Dr. Andrew Smith, this is not meant to be a full tutorial.
I’m Just a Padawan
Dr. Andrew Smith is a Jedi Master. I learned everything I know about SLCMA from him. Some materials have been borrowed/adapted from his teaching. I am grateful and humbled by the opportunity to learn from him.
The Importance of Social Science
DNAm Research
Limited Focus in Biology
Current Exposure
Ever Exposed
Contribution of Social Science
Several large panel studies
Many years of data
A plethora of data types
Really interesting questions/theory
What We Need
Way to use our vast data to test hypotheses across the life course
Systematic to avoid false-positive results
Efficient to accommodate analysis of high-dimensional data
Easy to use
SLCMA can help!
What is SLCMA?
Structured Life Course Modeling Approach
Which life course hypothesis best fits our data?
SLCMA Hypotheses
Big 3
Sensitive Periods
Accumulation
Recency
Others
Mobility
Change
Always Exposed
Ever Exposed
Hypotheses (Big 3)
Sensitive Periods \(\left(SP \text{ at } t_j\right)\)
The developmental timing of an exposure has the strongest effect on the outcome at a specific time point due to heightened levels of plasticity or reprogramming
Just the exposure variable at each age
Can be continuous or binary
\[SP_j = x_j\]
Hypotheses (Big 3)
Accumulation \(\left(Acc\right)\)
Every additional time point of exposure affects the outcome in a dose-response manner, independent of the exposure timing
Add up the exposure variable across
Can be continuous or binary
\[Acc = \sum_{j=1}^m{x_j}\]
Hypotheses (Big 3)
Recency \(\left(Rec\right)\)
More proximal exposures (closer in time to the of the outcome) are more strongly linked to the outcome than are more distal exposures
Add up the products of each exposure variable multiplied by its age of observation
Can be continuous or binary
\[Rec = \sum_{j=1}^m{\left(x_jt_j\right)}\]
SLCMA Steps
Fit a regression model for each single life course hypothesis of interest, as well as groups of compound hypotheses
Measure the goodness-of-fit of each model and select the best one
Calculate appropriate p-values for the selected model
Term
(Intercept)
sp04
sp26
sp43
Accumulation(sp04, sp26, sp43)
Recency(weights = c(4, 26, 43), sp04, sp26, sp43)
Role
Adjusted for in all models
Available for variable selection
Available for variable selection
Available for variable selection
Available for variable selection
Available for variable selection
Inference for model at Step 1 of LARS procedure
Number of selected variables: 1
R-squared from lasso fit: 0.022
Results from fixed lasso inference (selective inference):
Standard deviation of noise (specified or estimated) sigma = 5.210
Testing results at lambda = 18.578, with alpha = 0.050
Coef P-value CI.lo CI.up LoTailArea
Accumulation(sp04, sp26, sp43) -0.964 0 -1.272 -0.63 0.024
UpTailArea
Accumulation(sp04, sp26, sp43) 0.024