220306_130626 iffy misinfome retweet analyses

- [iffy domains](https://docs.google.com/spreadsheets/d/1ck1_FZC-97uDLIlvRJDTrGqBk0FuDe9yHkluROgpGS8/edit#gid=1144285784) - analyses below are for **retweets**, Oct 17 to Oct 24 Iffy misinfome domain ratings ![[s20220310_175335.png]] # Count analyses (quasi-poisson) Model: `count_time1 ~ condition[-0.5/0.5] * count_time0` - `count_time0LC`: `log(count_time0 + 1)` ```r # OLS m <- glm(count_t1 ~ conditionC * count_t0LC, dt1[domain_type == "overall"], family = "quasipoisson") summary(m) Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 0.8742605 0.0273824 31.928 <2e-16 *** conditionC 0.0185836 0.0547648 0.339 0.734 count_t0LC 0.8431191 0.0164322 51.309 <2e-16 *** conditionC:count_t0LC -0.0007253 0.0328643 -0.022 0.982 # robust SEs z test of coefficients: Estimate Std. Error z value Pr(>|z|) (Intercept) 0.87426049 0.02743417 31.8676 <2e-16 *** conditionC 0.01858362 0.05486833 0.3387 0.7348 count_t0LC 0.84311912 0.01723380 48.9224 <2e-16 *** conditionC:count_t0LC -0.00072532 0.03446761 -0.0210 0.9832 # account for blocking and robust SE > mb <- feglm(count_t1 ~ conditionC * count_t0LC | block, dt1[domain_type == "overall"], family = 'quasipoisson') NOTE: 667 fixed-effects (3,757 observations) removed because of only 0 outcomes. > summary(mb, vcov = "HC1") GLM estimation, family = quasipoisson, Dep. Var.: count_t1 Observations: 29,131 Fixed-effects: block: 4,757 Standard-errors: Heteroskedasticity-robust Estimate Std. Error t value Pr(>|t|) conditionC 0.013074 0.045290 0.288674 0.77283 count_t0LC 0.351705 0.028403 12.382643 < 2.2e-16 *** conditionC:count_t0LC 0.009051 0.027902 0.324405 0.74563 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 Squared Cor.: 0.414064 ``` # Continuous ratings analysis - only bad domains (hence quality upper bound is 30) - see also [[220301_145044 user ECDFs#iffy misinfome 471 bad domains|iffy misinfome user ECDFs]] Treatment condition shared somewhat higher quality bad stuff? ```r # account for blocking and robust SE > m1 <- feols(mean_t1 ~ conditionC * mean_t0C | block, dt1[domain_type == "overall"]) > summary(m1, vcov = "HC1") OLS estimation, Dep. Var.: mean_t1 Observations: 32,888 Fixed-effects: block: 5,424 Standard-errors: Heteroskedasticity-robust Estimate Std. Error t value Pr(>|t|) conditionC 0.063683 0.035724 1.782611 0.074661 . # treatment quality increased? mean_t0C 0.424388 0.007206 58.889661 < 2.2e-16 *** conditionC:mean_t0C 0.001111 0.012414 0.089486 0.928696 ``` # 3-bins analysis No interactions ```r # bin mean and size > dt1[, .(mean_t0 = mean(mean_t0), n = .N), keyby = .(bin = mean_t0_bin)] bin mean_t0 n 1: _1 3.181468 10981 2: _2 6.739604 14248 3: _3 12.928745 7659 > m3.1 <- feols(mean_t1 ~ conditionC * mean_t0_bin | block, dt1[domain_type == "overall"], vcov = "HC1") > m3.1 OLS estimation, Dep. Var.: mean_t1 Observations: 32,888 Fixed-effects: block: 5,424 Standard-errors: Heteroskedasticity-robust Estimate Std. Error t value Pr(>|t|) conditionC 0.095950 0.060811 1.577822 0.11462 mean_t0_bin_2 1.133373 0.051965 21.810400 < 2.2e-16 *** mean_t0_bin_3 3.734485 0.070092 53.279900 < 2.2e-16 *** conditionC:mean_t0_bin_2 -0.084701 0.078398 -1.080399 0.27997 conditionC:mean_t0_bin_3 0.027759 0.127958 0.216943 0.82825 # model comparisons (w/o condition) > m101 <- feols(mean_t1 ~ conditionC * mean_t0_bin | block, dt1[domain_type == "overall"]) > m102 <- feols(mean_t1 ~ mean_t0_bin | block, dt1[domain_type == "overall"]) > test_wald(m102, m101) Name | Model | df | df_diff | F | p ---------------------------------------------- m102 | fixest | 32886 | | | m101 | fixest | 32883 | 3.00 | 1.88 | 0.130 # no effect ``` # Exploratory stuff with other predictors - what happens if we also add retweet count during time0 and time1 in the models predicting mean quality at time1? ```r # predictors: condition, qualityt0, count0(log), countt1(log) > m <- feols(mean_t1 ~ conditionC * mean_t0C + t1_retweetLC + t0_retweetLC | block, dt2[domain_type == "overall"]) > summary(m, vcov = "HC1") OLS estimation, Dep. Var.: mean_t1 Observations: 32,888 Fixed-effects: block: 5,424 Standard-errors: Heteroskedasticity-robust Estimate Std. Error t value Pr(>|t|) conditionC 0.065554 0.035654 1.838597 6.5985e-02 . mean_t0C 0.425248 0.007173 59.284398 < 2.2e-16 *** t1_retweetLC 0.109740 0.013510 8.122573 4.7539e-16 *** t0_retweetLC -0.121054 0.015206 -7.961186 1.7694e-15 *** conditionC:mean_t0C 0.000786 0.012367 0.063561 9.4932e-01 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 RMSE: 2.93587 Adj. R2: 0.285441 Within R2: 0.19053 # interactions with condition > m <- feols(mean_t1 ~ conditionC * (mean_t0C + t1_retweetLC + t0_retweetLC) | block, dt2[domain_type == "overall"]) > summary(m, vcov = "HC1") OLS estimation, Dep. Var.: mean_t1 Observations: 32,888 Fixed-effects: block: 5,424 Standard-errors: Heteroskedasticity-robust Estimate Std. Error t value Pr(>|t|) conditionC 0.065544 0.035664 1.837814 6.6101e-02 . mean_t0C 0.425252 0.007174 59.279970 < 2.2e-16 *** t1_retweetLC 0.109689 0.013510 8.119062 4.8931e-16 *** t0_retweetLC -0.120971 0.015201 -7.958299 1.8111e-15 *** conditionC:mean_t0C 0.000779 0.012361 0.063026 9.4975e-01 conditionC:t1_retweetLC -0.005827 0.023615 -0.246769 8.0509e-01 conditionC:t0_retweetLC 0.005363 0.028757 0.186492 8.5206e-01 --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 RMSE: 2.93586 Adj. R2: 0.285391 Within R2: 0.190533 # all interactions > m <- feols(mean_t1 ~ conditionC * mean_t0C * t0_retweetLC * t1_retweetLC | block, dt2[domain_type == "overall"]) > summary(m, vcov = "HC1") OLS estimation, Dep. Var.: mean_t1 Observations: 32,888 Fixed-effects: block: 5,424 Standard-errors: Heteroskedasticity-robust Estimate Std. Error t value Pr(>|t|) conditionC 0.053694 0.037255 1.441248 1.4953e-01 mean_t0C 0.434035 0.008971 48.381519 < 2.2e-16 *** t0_retweetLC -0.110281 0.014052 -7.848166 4.3747e-15 *** t1_retweetLC 0.072430 0.012787 5.664168 1.4922e-08 *** conditionC:mean_t0C -0.030011 0.016387 -1.831423 6.7048e-02 . conditionC:t0_retweetLC 0.003427 0.026501 0.129329 8.9710e-01 mean_t0C:t0_retweetLC -0.035603 0.007491 -4.752873 2.0156e-06 *** conditionC:t1_retweetLC 0.000317 0.022498 0.014071 9.8877e-01 mean_t0C:t1_retweetLC 0.108930 0.004474 24.347782 < 2.2e-16 *** t0_retweetLC:t1_retweetLC -0.011408 0.003744 -3.047403 2.3105e-03 ** conditionC:mean_t0C:t0_retweetLC 0.026281 0.014571 1.803740 7.1283e-02 . conditionC:mean_t0C:t1_retweetLC -0.009928 0.008801 -1.128131 2.5927e-01 conditionC:t0_retweetLC:t1_retweetLC 0.000345 0.007288 0.047332 9.6225e-01 mean_t0C:t0_retweetLC:t1_retweetLC 0.002320 0.002956 0.784844 4.3255e-01 conditionC:mean_t0C:t0_retweetLC:t1_retweetLC 0.014693 0.005760 2.550745 1.0755e-02 * --- Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1 RMSE: 2.88318 Adj. R2: 0.310605 Within R2: 0.219322 ```