Prompt Engineering ChatGPT for Codenames

Matthew Sidji, Matthew Stephenson

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

The word association game Codenames challenges the AI community with its requirements for multimodal language understanding, theory of mind, and epistemic reasoning. Previous attempts to develop AI agents for the game have focused on word embedding techniques, which while good with other models using the same technique, can sometimes suffer from brittle performance when paired with other models. Recently, Large Language Models (LLMs) have demonstrated enhanced capabilities, excelling in complex cognitive tasks, including symbolic and common sense reasoning. In this paper, we compare a range of recent prompt engineering techniques for GPT-based Codenames agents. While there was no significant game score improvement over the baseline agent, we did observe qualitative changes in agents' strategies suggesting that further refinement has potential for score improvement. We also propose a revised Codenames AI competition specifically focusing on the use of LLM agents.

Original languageEnglish
Title of host publicationProceedings of the 2024 IEEE Conference on Games (CoG)
Place of PublicationUnited States
PublisherInstitute of Electrical and Electronics Engineers
Number of pages4
ISBN (Electronic)979-8-3503-5067-8
ISBN (Print)979-8-3503-5068-5
DOIs
Publication statusPublished - 28 Aug 2024
Event6th Annual IEEE Conference on Games - Milan, Italy
Duration: 5 Aug 20248 Aug 2024

Publication series

NameIEEE Conference on Games, CoG
ISSN (Print)2325-4270
ISSN (Electronic)2325-4289

Conference

Conference6th Annual IEEE Conference on Games
Abbreviated titleCoG 2024
Country/TerritoryItaly
CityMilan
Period5/08/248/08/24

Keywords

  • AI
  • ChatGPT
  • Codenames
  • Game Playing Agents
  • Prompt Engineering

Fingerprint

Dive into the research topics of 'Prompt Engineering ChatGPT for Codenames'. Together they form a unique fingerprint.

Cite this