Scottish Study of Early Learning and Childcare: ELC Leavers (Phase 2) Report - Updated 2021

Findings from the second phase of the Scottish Study of Early Learning and Childcare (SSELC), a research project established to evaluate the expansion of early learning and childcare (ELC) in Scotland.

This document is part of a collection


Appendix D – Regression analysis

Tables D1 and D2 show the results of logistic regression analysis of whether a child has delayed development on at least two domains of the Ages and Stages Questionnaire and of raised / high score on the Strengths and Difficulties Questionnaire total difficulties scale.

Logistic regression analysis is a method of summarising the relationship between a binary 'outcome' variable and one or more 'predictor' variables. It allows us to estimate the odds of a child having a score of '1' on the outcome variable (as opposed to '0') from knowledge of their scores on the predictor variables. In the model shown in Table D1 the score of '1' on the dependent variable refers to exhibiting delayed development on two or more of the ASQ domains, while a '0' refers to exhibiting no delayed development, or delayed development on just one of the domains.

Logistic regression allows us to consider multiple relationships at the same time and to identify those relationships between a predictor variable and the outcome variable which remain statistically significant even when we take into account other predictor variables. For those variables that do remain significant we can say that they show an independent association with the outcome variable while controlling all other factors in the model.

Tables D1 and D2 show how the odds for each category of each predictor variable compare with the odds for the reference category. An odds ratio of greater than 1 indicates that, holding all other factors constant, there is an increased likelihood of a child in that category being in the category '1' for the outcome variable compared with a child in the base category. For example, in Table D1, the odds ratio of 4.10 for the category 'Male' means that boys are more likely than girls (the base category) to exhibit delayed development on two or more of the ASQ domains (and the odds of a boy exhibiting delayed development are 4.09 times those for a girl, holding all other factors constant). Conversely, an odds ratio of below 1 means they have lower odds of exhibiting delayed development than respondents in the reference category.

Because data are taken from a sample, we recognise that the odds ratios are only estimates, so we also include confidence intervals around each estimate. If the survey were to be repeated, we would expect the true value to fall within these odds ratios 95 times out of 100.

Two measures of statistical significance are provided. The first is for the comparison between a particular category and the base category, while the second is for the variable as a whole. Where the independent variable has just two categories, these are the same. A significance level of 0.05 or less indicates that there is less than a 5% chance we would have found these differences between the categories just by chance if in fact no such difference exists, hence we can say that we are 95% sure there is a relationship between the predictor and outcome variables. A level of <0.001 indicates that there is a less than 0.1% chance, so we can say that we are 99.9% sure that the relationship exists. For the purposes of Tables 9 and 10, we described a level of significance of less than 0.01 as "highly significant", of between 0.01 and 0.05 as "moderately significant, and of between 0.05 and 0.10 as "marginally significant".

The Nagelkerke R-square value provided at the bottom of each model is a rough indication of the proportion of variation in the outcome variable explained by the predictor variables in the model. In each of the models this is between 0.2 and 0.26, which is fairly typical for this type of analysis. This means that there is a lot of variation in the data which is not explained by the variables (and nor would we expect it to be).

Both models have been tested for stability through the systematic removal of variables to check for changes in odds ratios and significance of other variables, and checks on the covariation of independent variables, and both were found to be stable. The variable for frequency of sleeping through the night was not included in the models because of its strong correlation with other variables in the model, which would have affected the overall stability.

Table D1: Logistic regression model of delayed development on at least two domains of the Ages and Stages Questionnaire
  Odds Ratio Confidence interval Sig. (compared with base) Sig. (overall)
Sex of child      

<0.001

Male

4.10

(2.64 - 6.37)

<0.001

 
Female        
Long term health condition (reported by either parent or keyworker)      

<0.001

Yes

4.78

(2.99 - 7.64)

<0.001

 
No (+missing)        
Parental long-term condition      

0.352

Yes

0.76

(0.42 - 1.36)

0.352

 
No (+ missing)        
Single / couple parent household      

0.533

Single parent household

1.17

(0.72 - 1.91)

0.533

 
Couple parent household (+ missing)        
Number of siblings < 16 in household      

0.036

None / One (+ missing)

0.60

(0.37 - 0.97)

0.036

 
Two or more        
Age of parent / respondent      

0.639

20 to 29

1.34

(0.64 - 2.81)

0.439

 
30 to 34

1.43

(0.75 - 2.74)

0.272

 
35 to 39 (+ missing)

1.53

(0.79 - 2.94)

0.204

 
40+        
Language spoken at home      

0.034

Not English / dual language

2.08

(1.06 - 4.08)

0.034

 
English (+ missing)        
Area deprivation (SIMD) of home address      

0.295

40% most deprived

1.27

(0.81 - 2.01)

0.295

 
Less deprived (+ missing)        
Equivalised household incrome      

0.758

Lowest 40%

1.08

(0.66 - 1.75)

0.758

 
Middle / higher incomes (+ missing)        
Whether respondent in work / training      

0.166

No

1.45

(0.86 - 2.44)

0.166

 
Yes (+ missing)        
Highest qualification of respondent      

0.009

None / lower school (Standard Grade, etc.)

2.16

(1.32 - 3.53)

0.002

 
Upper school / post-school/pre-HE (Highers, HNC, etc.) (+ missing)

1.23

(0.75 - 2.03)

0.412

 
Degree / HE        
Parental mental wellbeing (SWEMWBS)      

0.155

Low (1 s.d below the mean)

1.50

(0.86 - 2.64)

0.155

 
Average / high (+ missing)        
Feelings about amount of support from friends / family outside of household      

0.256

Don't get enough / don't get any

1.38

(0.79 - 2.43)

0.256

 
Get enough / don't need (+ missing)        
Parental self-efficacy      

0.285

Coping well as a parent less often

0.74

(0.43 - 1.28)

0.285

 
Coping well as a parent most or all of the time (+ missing)        
How many hours sleep per 24 hours      

0.229

< 10 hours

1.70

(0.92 - 3.16)

0.090

 
10 hours

1.05

(0.65 - 1.70)

0.839

 
11+ hours (+ missing)        
Ever breastfed      

0.192

Yes (+ missing)

0.75

(0.48 - 1.16)

0.192

 
No        
Home learning environment scale      

0.881

Lowest quartile (least frequent activities)

0.97

(0.62 - 1.51)

0.881

 
Higher quartiles (+ missing)        
Confusion, hubbub and order scale (CHAOS)      

0.207

Highest tertile (most chaotic)

1.67

(0.95 - 2.94)

0.076

 
Middle tertile (+ missing)

1.42

(0.81 - 2.51)

0.223

 
Lowest tertile (least chaotic)        
Total hours of childcare (formal and informal)      

0.247

> 30

0.78

(0.43 - 1.40)

0.394

 
> 18, up to 30

0.60

(0.33 - 1.10)

0.099

 
Up to 18        
Intercept       <0.001
 

0.03

(0.01 - 0.08)

<0.001

 
n = 1301        
Nagelkerke R square = 0.256        
Table D2: Logistic regression model of raised / high score on the Strengths and Difficulties Questionnaire total difficulties scale
  Odds Ratio Confidence interval Sig. (compared with base) Sig. (overall)
Sex of child      

<0.001

Male

2.37

(1.47 - 3.82)

<0.001

 
Female        
Long term health condition (reported by either parent or keyworker)  

 

 

<0.001

Yes

3.59

(2.27 - 5.67)

<0.001

 
No (+missing)        
Parental long-term condition      

0.900

Yes

1.03

(0.63 - 1.70)

0.900

 
No (+ missing)        
Single / couple parent household      

0.520

Single parent household

1.22

(0.67 - 2.22)

0.520

 
Couple parent household (+ missing)        
Number of siblings < 16 in household      

0.340

None / One (+ missing)

1.30

(0.76 - 2.20)

0.340

 
Two or more        
Age of parent / respondent      

0.323

20 to 29

0.91

(0.46 - 1.78)

0.776

 
30 to 34

1.16

(0.72 - 1.88)

0.536

 
35 to 39 (+ missing)

0.74

(0.40 - 1.37)

0.332

 
40+        
Language spoken at home      

0.646

Not English / dual language

0.84

(0.40 - 1.78)

0.646

 
English (+ missing)        
Area deprivation (SIMD) of home address      

0.558

40% most deprived

0.86

(0.52 - 1.42)

0.558

 
Less deprived (+ missing)        
Equivalised household incrome      

0.119

Lowest 40%

1.53

(0.90 - 2.60)

0.119

 
Middle / higher incomes (+ missing)        
Whether respondent in work / training      

0.187

No

1.39

(0.85 - 2.25)

0.187

 
Yes (+ missing)        
Highest qualification of respondent      

0.974

None / lower school (Standard Grade, etc.)

1.05

(0.59 - 1.85)

0.877

 
Upper school / post-school/pre-HE (Highers, HNC, etc.) (+ missing)

1.06

(0.64 - 1.76)

0.821

 
Degree / HE        
Parental mental wellbeing (SWEMWBS)      

0.001

Low (1 s.d below the mean)

2.30

(1.38 - 3.83)

0.001

 
Average / high (+ missing)        
Feelings about amount of support from friends / family outside of household      

0.451

Don't get enough / don't get any

0.81

(0.46 - 1.41)

0.451

 
Get enough / don't need (+ missing)        
Parental self-efficacy      

0.181

Coping well as a parent less often

1.38

(0.86 - 2.22)

0.181

 
Coping well as a parent most or all of the time (+ missing)        
How many hours sleep per 24 hours      

0.341

< 10 hours

1.49

(0.83 - 2.68)

0.178

 
10 hours

0.92

(0.57 - 1.50)

0.742

 
11+ hours (+ missing)        
Ever breastfed      

0.271

Yes (+ missing)

0.77

(0.49 - 1.23)

0.271

 
No        
Home learning environment scale      

0.112

Lowest quartile (least frequent activities)

1.41

(0.92 - 2.17)

0.112

 
Higher quartiles (+ missing)        
Confusion, hubbub and order scale (CHAOS)      

0.263

Highest tertile (most chaotic)

1.53

(0.89 - 2.63)

0.122

 
Middle tertile (+ missing)

1.51

(0.86 - 2.64)

0.152

 
Lowest tertile (least chaotic)        
Total hours of childcare (formal and informal)      

0.348

> 30

0.97

(0.57 - 1.63)

0.892

 
> 18, up to 30

0.65

(0.36 - 1.18)

0.152

 
Up to 18        
Intercept       <0.001
 

0.03

(0.01 - 0.08)

<0.001

 
n = 1313        
Nagelkerke R square = 0.214        

How to access background or source data

The data collected for this social research publication:

☐ are available in more detail through Scottish Neighbourhood Statistics

☐ are available via an alternative route

☒ may be made available on request, subject to consideration of legal and ethical factors. Please contact socialresearch@gov.scot for further information.

☐ cannot be made available by Scottish Government for further analysis as Scottish Government is not the data controller.

Contact

Email: socialresearch@gov.scot

Back to top