Jones, C. R. & Bergen, B. (under review). People cannot distinguish GPT-4 from a human in a Turing test. [preprint]
Jones, C. R., Bergen, B., & Trott, S. (2024). Do Multimodal Large Language Models and Humans Ground Language Similarly?, Computational Linguistics [paper]
Jones, C. R., Trott, S., & Bergen, B. (to appear). Comparing Humans and Large Language Models on an Experimental Protocol Inventory for Theory of Mind Evaluation (EPITOME). Transactions of the Association of Computational Linguistics [paper]
Jones, C. R. & Trott, S. (2024). Multimodal Language Models Show Evidence of Embodied Simulation. LREC-COLING 2024, [paper]
Jones, C. R. & Bergen, B. (2024). Does GPT-4 Pass the Turing Test?. NAACL 2024 [paper]
Jones, C. R. & Bergen, B. (2024). Does word knowledge account for the effect of world knowledge on pronoun interpretation? Language and Cognition [paper]
Trott, S.*, Jones, C. R.*, Michaelov, J. A., Chang, T. A., & Bergen, B. (2023). Do Large Language Models know what humans know?. Cognitive Science, 47(7). [*co-first author] [paper]
Jones, C. R., Chang, T. A., Coulson, S., Michaelov, J. A., Trott, S., & Bergen, B. (2022). Distributional Semantics Still Can't Account for Affordances. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 44, No. 44). [paper]
Jones, C. R. & Bergen, B. (2021). The Role of Physical Inference in Pronoun Resolution. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 43, No. 43). [paper]
Binder, F. J.*, Jones, C. R.*, Kaufman, R. A., Lin, N. T., Poole, C. R., & Vul, E. (2021). Cognitive cost and information gain trade-off in a large-scale number guessing game. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 43, No. 43). [*co-first author] [paper]
Jones, C. & Kirby, S. (2018). The Effect of Biasing Information on a Transmission Chain of Short Texts. 2nd Conference of the Cultural Evolution Society, Arizona, USA.