gt; summ(feols(accuracy ~ conditionC * modelF * strategy, d1, cluster = ~responseid)) term result sig <char> <char> <char> 1: (Intercept) b = 63.57 [62.29, 64.85], p < .001 *** # baseline, gpt 2: conditionC b = 0.90 [-1.66, 3.46], p = .490 # no difference in accuracy between conditions 3: modelFDeepSeek-V3 b = -3.40 [-5.28, -1.51], p < .001 *** # deepseek is less accurate than gpt 4: modelFLlama-4 b = 6.32 [4.52, 8.13], p < .001 *** # llama is more accurate than gpt 5: strategynofacts b = 4.45 [-0.36, 9.26], p = .070 . # nofacts has much less messages than baseline 6: conditionC × modelFDeepSeek-V3 b = 0.07 [-3.69, 3.83], p = .971 7: conditionC × modelFLlama-4 b = 0.81 [-2.79, 4.42], p = .658 8: conditionC × strategynofacts b = -8.09 [-17.72, 1.54], p = .099 . 9: modelFDeepSeek-V3 × strategynofacts b = 8.67 [-0.84, 18.17], p = .074 . 10: modelFLlama-4 × strategynofacts b = -5.70 [-11.55, 0.15], p = .056 . 11: conditionC × modelFDeepSeek-V3 × strategynofacts b = 1.59 [-17.42, 20.59], p = .870 12: conditionC × modelFLlama-4 × strategynofacts b = 3.56 [-8.13, 15.26], p = .550 ``` ![[1747066479.png]]