New Study Reveals AI Struggles with Reading Analogue Clocks and Calendar Date Calculations

New research has identified a set of tasks that humans handle effortlessly but artificial intelligence (AI) struggles with—specifically reading analogue clocks and determining the day of the week for a given date. Although AI can generate code, images, human-like text, and even pass exams to varying degrees, it frequently misinterprets clock hand positions and fails basic calendar arithmetic. Presented at the 2025 International Conference on Learning Representations (ICLR) and published on the preprint server arXiv (not yet peer-reviewed), the study highlights significant gaps in AI’s ability to perform tasks humans master early in life. Lead author Rohit Saxena of the University of Edinburgh emphasized that these shortcomings must be addressed for AI to be effectively applied in time-sensitive and real-world contexts like scheduling, automation, and assistive technologies. The researchers tested various multimodal large language models (MLLMs)—including Meta’s Llama 3. 2-Vision, Anthropic’s Claude-3. 5 Sonnet, Google’s Gemini 2. 0, and OpenAI’s GPT-4o—using a custom dataset of clock and calendar images. The models failed to correctly identify clock times or determine weekdays for sample dates over half the time, with accuracy rates of only 38. 7% for clocks and 26. 3% for calendar tasks. Saxena explained that AI’s poor clock-reading stems from its lack of spatial reasoning—tasks requiring detection of overlapping hands, angle measurements, and interpreting diverse clock designs, such as Roman numerals or stylized dials. Recognizing an image as a clock is easier for AI than reading it accurately.
Similarly, despite arithmetic being fundamental to computing, large language models do not perform calculations via algorithms; instead, they predict outputs based on training data patterns. This leads to inconsistent and non-rule-based reasoning, explaining high failure rates on date-related arithmetic. This study adds to growing evidence that AI’s mode of “understanding” differs fundamentally from human cognition. AI excels when abundant training examples exist but struggles with abstract reasoning and generalization, especially on tasks mixing perception with precise logic. Furthermore, limited training data on rarer phenomena like leap years hampers performance, as AI fails to make necessary conceptual connections. The findings underscore the need for richer, targeted datasets and reevaluation of AI’s capability to integrate logical and spatial reasoning, highlighting risks of over-reliance on AI outputs in complex tasks. Saxena stressed the necessity of rigorous testing, fallback mechanisms, and often human oversight when AI is tasked with combining perception and exact reasoning.
Brief news summary
New research presented at the 2025 International Conference on Learning Representations highlights significant limitations in current AI models like Meta’s Llama 3.2-Vision, Anthropic’s Claude-3.5 Sonnet, Google’s Gemini 2.0, and OpenAI’s GPT-4o. Despite recent advancements, these models struggle with tasks that are straightforward for humans, such as reading analog clocks and determining weekdays from dates. The study found these models correctly interpreted clock times only 38.7% of the time and calendar dates just 26.3%, underscoring their reliance on pattern recognition rather than genuine reasoning abilities. Led by Rohit Saxena from the University of Edinburgh, the research reveals that while AI systems can identify objects accurately, they face notable challenges with complex spatial and logical reasoning tasks, especially involving uncommon events like leap years. The findings emphasize the need for new training approaches that integrate logical and spatial reasoning skills and warn against overdependence on AI for tasks requiring precise calculations. Ultimately, the study highlights fundamental differences between human cognition and AI pattern matching, advocating for comprehensive validation and human oversight in time-sensitive real-world applications.
AI-powered Lead Generation in Social Media
and Search Engines
Let AI take control and automatically generate leads for you!

I'm your Content Manager, ready to handle your first test assignment
Learn how AI can help your business.
Let’s talk!

Nvidia gets an AI boost, Meta hits a AI roadblock…
The next battleground in the AI arms race isn’t Beijing—it’s Riyadh, at least according to Wedbush.

The Public internet is a bottleneck for blockchai…
According to Austin Federa, co-founder and CEO of DoubleZero—a project focused on developing high-speed fiber optic communication rails for blockchains—the public internet infrastructure is the primary speed and performance bottleneck for high-throughput blockchain networks.

Shoosmiths Incentivizes AI Adoption with £1 Milli…
At the start of last month, Shoosmiths, a British law firm with 1,500 employees, announced a £1 million bonus pool to be shared among staff if they collectively adopted Microsoft’s AI tool, Copilot, in their workflows.

JP Morgan Settles First Tokenized Treasury Transa…
JP Morgan has completed its inaugural transaction on a public blockchain, signaling the financial giant’s growing engagement with the Web3 ecosystem.

AI chips are the new 'coin of the realm' as they …
© 2025 Fortune Media IP Limited.

Central Banks Explore Blockchain to Modernize Mon…
Central banks are starting to investigate how programmable blockchain technologies could transform the implementation of monetary policy.

Star Wars' Showcase of AI Special Effects Was a C…
If Disney leadership has its way, we’ll be inundated with endless Star Wars reboots, sequels, and spinoffs until the Sun eventually explodes.