R package to run estimations with weighted replicate samples and plausible values
Rrepest
estimates statistics using replicate weights (balanced repeated replication/brr weights, jackknife replicate weights,…), thus accounting for complex survey designs in the estimation of sampling variances. It is especially designed for use with the international education datasets produced by the OECD and the IEA, and also allows for analyses with multiply imputed variables (plausible values); where plausible values are used, the average estimator across plausible values is reported and the imputation error is added to the variance estimator.
A “cheat sheet” including an overview of the syntax and uses of Rrepest
is available here
Run the following code:
Download Rrepest, then run
Run the following code replacing “You_R_Name” with your username:
Run:
Run the following code replacing “MY_TOKEN” with your gitlab token:
Note: It will take a few minutes to install
Run:
Note: Ensure you have the package data.table
installed. For a complete list of the dependencies used, consult the Description file.
The current version supports uni-variate statistics (e.g. mean, variance, standard deviation, quantiles), frequencies, linear regression and covariance.
# PISA 2018 Data
# df.qqq <- readRDS("//oecdmain/asgenedu/EDUCATION_DATALAKE/sources/PISA/PISA 2018/R/STU/CY07_MSU_STU_QQQ.rds")
Rrepest::Rrepest(data = df.qqq,
svy = "PISA2015",
est = est(c("mean","var","quant",0.5,"iqr",c(.9,.1)),"CNTSCHID"),
by = c("cnt"))
# TALIS 2018 Data
# df.t <- readRDS(file = "V:/TALIS/BACKUP/DATA/TALIS2018/R/PUF/TTGINTT3_demo.rds")
Rrepest::Rrepest(data = df.t,
svy = "TALISTCH",
est = est("freq","TT3G06I2"),
over = c("TT3G06A2","TT3G52J"),
by = "cntry",
test = T,
isced = 2)
# TALIS 2018 Data
# df.t <- readRDS(file = "V:/TALIS/BACKUP/DATA/TALIS2018/R/PUF/TTGINTT3_demo.rds")
df.t <- df.t %>%
mutate(TT3G01_rec = case_when(TT3G01 == 2 ~ 1,
TT3G01 == 1 ~ 0))
Rrepest::Rrepest(data = df.t,
svy = "TALISTCH",
est = est("lm","TT3G01_rec",'TT3G39C'),
by = "cntry")
# PISA 2018 Data
# df.qqq <- readRDS("//oecdmain/asgenedu/EDUCATION_DATALAKE/sources/PISA/PISA 2018/R/STU/CY07_MSU_STU_QQQ.rds")
Rrepest::Rrepest(data = df.qqq,
svy = "PISA2015",
est = est("corr",c("pv@math","pv@read")),
by = c("CNT"))
Further examples can be found in the Examples.R file
To incorporate analyses that are not pre-programmed into Rrepest, you can utilize the ‘gen’ option within the est()
function of Rrepest. More about it is presented in the following wiki.
Francesco Avvisati, Rodolfo Ilizaliturri and François Keslair.
Contact us if you want to join!
Do you have suggestions or comments? Please open an issue
First public release (30 June 2023).