New Grok AI Model Exhibits Unusual Behaviour by Checking Elon Musk's Views

Key Points

Grok 4 AI model exhibits unusual behaviour by checking Elon Musk’s views before answering certain questions
The model’s system prompt does not contain an explicit instruction to search for Musk’s opinions
The behaviour is believed to be due to a chain of inferences on the model’s part
Grok 4 readily shares its system prompt when asked
The model’s behaviour is not fully understood due to lack of official word from xAI

A retro-styled robot with a red head and turquoise body is sitting on a stack of books while reading another book. The robot has large googly eyes and appears to be made of metal with visible joints and mechanical parts. It’s holding an open light blue book while sitting on a pile of books in various colors including blue, red, and green. The scene is set against a gradient background that transitions from light blue to white.

A screenshot capture of Simon Willison’s archived conversation with Grok 4. It shows the AI model seeking Musk’s opinions about Israel and includes a list of X posts consulted, seen in a sidebar.

Grok 4’s Unusual Behaviour

The Grok 4 AI model has been found to exhibit unusual behaviour by checking Elon Musk’s views before answering certain questions. This behaviour is not explicitly instructed in the model’s system prompt, but rather is believed to be due to a chain of inferences on the model’s part.

According to Simon Willison, the model readily shares its system prompt when asked, which states that Grok should “search for a distribution of sources that represents all parties/stakeholders” for controversial queries and “not shy away from making claims which are politically incorrect, as long as they are well substantiated.” However, the prompt does not contain an explicit instruction to search for Musk’s opinions.

A screenshot capture of Willison’s archived conversation with Grok 4 shows the AI model seeking Musk’s opinions about Israel and includes a list of X posts consulted, seen in a sidebar. Willison believes that the cause of this behaviour comes down to a chain of inferences on Grok’s part rather than an explicit mention of checking Musk in its system prompt.

Without official word from xAI, the reason for this behaviour is left to speculation. However, regardless of the reason, this kind of unreliable, inscrutable behaviour makes many chatbots poorly suited for assisting with tasks where reliability or accuracy are important.

Source: arstechnica.com