Notable opinion essay Microsoft

Why you shouldn't leave model selection on default in Copilot, Gemini and other AI tools

Published
May 24, 2026 — 10:17 UTC
Also in this story: Google Google DeepMind

Microsoft’s AI tool, Copilot, has come under scrutiny for its tendency to generate misleading outputs when users rely on default model settings. Mathematician Adam Kucharski’s experiment revealed that Copilot produced inaccurate stereotypes based on country labels, highlighting a critical flaw in how these tools handle data. This issue is particularly pressing as businesses increasingly adopt AI for data analysis, underscoring the importance of understanding model selection.

Kucharski’s findings indicate that when he fed Copilot identical datasets labeled with different countries, the tool failed to provide accurate insights, instead inventing differences that did not exist. This raises significant concerns about the reliability of AI-generated data interpretations, especially in a landscape where decisions based on flawed analysis can have serious repercussions. The article emphasizes that while advanced models can potentially catch these errors, users must be proactive in selecting the appropriate model rather than relying on defaults. This is a crucial takeaway for product managers and founders who utilize AI tools in their operations.

As AI tools like Copilot and Gemini become more integrated into business processes, the implications of these findings could reshape user practices. Companies may need to invest in training for their teams to better understand model selection and its impact on outcomes. The conversation around AI accountability and transparency is likely to intensify, prompting users to demand more robust solutions from AI developers.

Moving forward, it will be essential to monitor how AI companies address these model selection challenges and whether they implement features that guide users toward more accurate data analysis.

Turing Wire

By Turing Wire editorial staff · May 24, 2026 · Editorial standards →

Source: The Decoder