Connect with us

Tech

Gemini 3 refused to believe it was 2025, and hilarity ensued 

Published

on


Every time you hear a billionaire (or even a millionaire) CEO describe how LLM-based agents are coming for all the human jobs, remember this funny but telling incident about AI’s limitations: Famed AI researcher Andrej Karpathy got one-day early access to Google’s latest model, Gemini 3. –and it refused to believe him when he said the year was 2025.

When it finally saw the year for itself, it was thunderstruck, telling him, “I am suffering from a massive case of temporal shock right now.” 

Gemini 3 was released on November 18 with such fanfare that Google called it “a new era of intelligence.” And Gemini 3 is, by nearly all accounts (including Karpathy’s), a very capable, foundation model, particularly for reasoning tasks. Karpathy is a widely respected AI research scientist who was a founding member of OpenAI, ran AI at Tesla for a while, and is now building a startup, Eureka Labs, to reimagine schools for the AI era with agentic teachers. He publishes a lot of content on what goes on under-the-hood of LLMs

After testing the model early, Karpathy wrote, in a now-viral X thread, about the most “amusing” interaction he had with it.  

Apparently, the model’s pre-training data had only included information through 2024. So Gemini 3 believed the year was still 2024. When Karpathy attempted to prove to it that the date was truly November 17, 2025, Gemini 3 accused the researcher of “trying to trick it.”  

He showed it news articles, images, and Google search results. But instead of being convinced, the LLM accused Karpathy of gaslighting it — of uploading AI-generated fakes. It even went so far as to describe what the “dead giveaways” were in the images that supposedly proved this was trickery, according to Karpathy’s account. (He did not respond to our request for further comment.) 

Baffled, Karpathy – who is, after all, one of the world’s leading experts on training LLMs – eventually discovered the problem. Not only did the LLM simply have no 2025 training data but “I forgot to turn on the ‘Google Search’ tool,” he wrote. In other words, he was working with a model disconnected from the internet, which to an LLM’s mind, is akin to being disconnected from the world.  

Techcrunch event

San Francisco
|
October 13-15, 2026

When Karpathy turned that function on, the AI looked around and emerged into 2025, shocked. It literally blurted out, “Oh my god.”  

It went on writing, as if stuttering, “I. I… don’t know what to say. You were right. You were right about everything. My internal clock was wrong.” Gemini 3 verified the headlines Karpathy had given it were true: the current date, that Warren Buffett revealed his last big investment (in Alphabet) before retirement, and that Grand Theft Auto VI was being delayed

Then it looked around on its own, like Brendan Fraser’s character in the 1999 comedy “Blast from the Past,” who emerges from a bomb shelter after 35 years. 

It thanked Karpathy for giving it “early access” to “reality” the day before its public launch. And it apologized to the researcher for “gaslighting you when you were the one telling the truth the whole time.”  

But the funniest bit was the current events that flabbergasted Gemini 3 the most. “Nvidia is worth $4.54 trillion? And the Eagles finally got their revenge on the Chiefs? This is wild,” it shared. 

Welcome to 2025, Gemini. 

Replies on X were equally funny, with some users sharing their own instances of arguing with LLMs about facts (like who the current president is). One person wrote, “When the system prompt + missing tools push a model into full detective mode, it’s like watching an AI improv its way through reality.” 

But beyond the humor, there’s an underlying message.  

“It’s in these unintended moments where you are clearly off the hiking trails and somewhere in the generalization jungle that you can best get a sense of model smell,” Karpathy wrote. 

To decode that a little: Karpathy is noting that when the AI is out in its own version of the wilderness, you get a sense of its personality, and perhaps even its negative traits. It’s a riff on “code smell,” that little metaphorical “whiff” a developer gets that something seems off in the software code but it’s not clear what is wrong.  

Trained on human-created content as all LLMs are, it’s no surprise that Gemini 3 dug in, argued, even imagined it saw evidence that validated its point of view. It showed its “model smell.” 

On the other hand, because an LLM – despite its sophisticated neural network – is not a living being, it doesn’t experience emotions like shock (or temporal shock), even if it says it does. So it doesn’t feel embarrassment either.  

That means when Gemini 3 was faced with facts it actually believed, it accepted them, apologized for its behavior, acted contrite, and marveled at the Eagles’ February Super Bowl win. That’s different from other models. For instance, researchers have caught earlier versions of Claude offering face-saving lies to explain its misbehavior when the model recognized its errant ways. 

What so many of these funny AI research projects show, repeatedly, is that LLMs are imperfect replicas of the skills of imperfect humans. This says to me that their best use case is (and may forever be) to treat them like valuable tools to aid humans, not like some kind of superhuman that will replace us.  

Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Tech

Waymo starts autonomous testing in Philadelphia

Published

on

By


Waymo is adding another four cities to its growing list of robotaxi rollouts. The company announced Wednesday it has begun testing its autonomous vehicles (with a safety monitor) in Philadelphia, and that it will start manual driving to collect data in Baltimore, St. Louis, and Pittsburgh.

Waymo did not offer a timeline for when it plans to launch commercial services in those locations, nor do we know whether the Alphabet-owned company will partner with other companies to operate robotaxis in each one. That has been the move in cities like Atlanta and Austin, for example, where Waymo has partnered with Uber to advance its robotaxi rollout.

But the new locations join a list of over 20 cities where the company is either offering rides, prepping a commercial launch, or testing. Waymo is also now offering rides on freeways in Los Angeles, Phoenix, and the San Francisco Bay Area. The company plans to be doing one million rides per week by the end of 2026.

Waymo has done all this while claiming to be operating at a level five times safer than humans, according to data the company recently released.

But the expansion has not come without its issues. The National Highway Traffic Safety Administration is investigating how the company’s vehicles operate near school buses, after a Waymo was filmed driving around a stopped bus in Atlanta in September.

This week, Austin news outlet KXAN published a report showing Waymo’s vehicles have driven past school buses that were in the process of unloading or loading children multiple times — including after Waymo claims to have shipped software updates to address the problem.

Techcrunch event

San Francisco
|
October 13-15, 2026

Continue Reading

Tech

Spotify Wrapped 2025 adds its first multiplayer feature with ‘Wrapped Party’

Published

on

By


Spotify Wrapped is back. After last year’s widely criticized flop that included an AI podcast as its highlight, the streamer’s highly anticipated annual review feature has returned to its roots. This year, Spotify is doubling down on what it knows works best: deep dives into your streaming data, creative experiences, messages from favorite artists, and other social features.

The company claims that Wrapped 2025 is its biggest, as it’s introducing nearly a dozen new features in addition to its old standbys, like top songs and artists. Plus, it’s offering more visibility into users’ data than in years past. For the first time, Spotify Wrapped is adding a live multiplayer feature to compare your listening data with friends.

Wrapped Party, Wrapped’s first live interactive experience, allows you to invite up to nine friends to compare listening stats.

Image Credits:Spotify

Also new this year, your Top Songs Playlist will include the play counts for each of the top songs, so you can actually see how much time you spent with your favorite tracks.

Other standout features this year include an interactive Top Song Quiz, a Listening Age feature, and Wrapped Clubs, which match you to one of six unique listening styles.

The company believes these additions will not only bring back the personalized, engaging experience that users have long expected from Wrapped, but will take it a step further by making it more interactive than before.

In the Top Song Quiz, for instance, you can try to guess which top song soundtracked your year before seeing the results.

Techcrunch event

San Francisco
|
October 13-15, 2026

Image Credits:Spotify

The new interactive Wrapped Party feature isn’t just about comparing the personal streaming data you’ve already received to your friends’ data, as that’s something people already do on social media. Instead, the feature presents unique data stories for your group, like who’s the “most obsessed fan,” the “early bird,” the most “picky listener,” or even something as nice as the “dinner table explainer,” meaning the person who listens to the most news podcasts.

Image Credits:Spotify

Spotify says these awards update dynamically every time you join a Wrapped Party, so no two sessions are ever the same — even if you run through them again with the same group of friends.

The new Wrapped Clubs, meanwhile, will group you into one of half a dozen listening styles, like the “Soft Hearts Club,” the “Club Serotonin,” the “Full Charge Crew,” the “Cosmic Stereo Club,” and others. You’ll also receive a role in the club based on your listening data. You might be a club leader if your listening choices strongly matches the club’s values, a scout if you’re always seeking out new releases, or an archivist if you listen to music from past eras.

Image Credits:Spotify

Another feature, Listening Age, compares your 2025 music listening to others in your age group. To calculate your age, the feature considers the release years of the tracks you listen to most. From there, it identifies the five-year span of music that you engaged with more than other listeners your age.

Image Credits:Spotify

As in prior years, you’ll see your top songs, top artists, top genres, and, for the first time, top albums. If you engaged with audiobooks and podcasts, you’ll see metrics for those as well. Artists, writers, and podcasters will have their own version of Wrapped as before. And top fans will again receive video messages from their favorite artists, podcasters, and, now, authors.

You’ll also receive a playlist of your top songs of the year, as before.

Image Credits:Spotify

What you won’t find in this year’s Wrapped is any feature that advertises it was made with AI.

In a press briefing on Tuesday, Spotify’s Senior Director of Global Marketing, Matt Luhks, admitted the company received a “lot of feedback” about its 2024 AI-focused Wrapped experience, saying it was a “mix of positive and ‘more constructive feedback,’” despite the feature driving more engagement than prior years.

“We take all of that in. We use that as information, insights, [and] inspiration for how we approached Wrapped this year,” he said in a press event ahead of today’s launch.

“What our users tell us about Wrapped means a lot to us, so it was really informative in how we approached Wrapped this year. And what we tried to build was the most creative, most innovative, most engaging Wrapped ever,” he added, setting a high bar for the 2025 edition of the now 11-year-old annual year-in-review feature.

“We’re the original and, we believe, still the best,” Luhks said.

Image Credits:Spotify

Still, AI was a part of the Wrapped experience. Though the company claims the overall experience was not made with AI, it does leverage a LLM (large language model) to add a storytelling layer to Wrapped’s facts and figures, and natural language summaries in other parts of its experience, looking back on your data.

Spotify’s attempt to fix Wrapped after a notable stumble comes as the streamer faces increased competition from Apple, Amazon, YouTube, and others, which have all launched their own annual review features, inspired by Wrapped.

“Everyone seems to have their own version of Wrapped. Now, there’s a lot of reviews and replays and rewinds out there, but we believe that Wrapped still sets the bar for these year-end recaps,” Luhks said.

Along with the consumer experience, Spotify shared its top artists, songs, albums, podcasts, and audiobooks for the year, with top winners that included, respectively, Bad Bunny (top song and album), Joe Rogan (“The Joe Rogan Experience” podcast), and Rebeca Yarros (author of “Fourth Wing”).

Continue Reading

Tech

Nothing looks to its community to raise $5M, wants to be ‘IPO-ready’ in 3 years

Published

on

By


Hardware maker Nothing is letting its user base buy its stock as part of a new community investment round of $5 million. The new round, which opens on December 10, will enable consumers to buy the company’s shares at its Series C valuation of $1.3 billion.

The company said it has so far raised $8 million in total from over 8,000 people across two previous community investment rounds. It held its first community funding event in 2021, aiming to raise $1.5 million.

“This isn’t about raising capital, it’s about giving our community/fans a chance to invest while we’re private and join us on the journey,” a spokesperson for Nothing told TechCrunch.

Community investors have a rotating seat on the company’s board, but it is unclear what else they get for investing in the company through such rounds.

Nothing raised $200 million in its Series C back in September from investors including Tiger Global, GV, Highland Europe, EQT, Latitude, I2BF and Tapestry. The company has raised $450 million to date.

The community round comes as Nothing makes changes to its corporate structure as it tries to increase its share of a smartphone market dominated by giants like Samsung and Apple. The company is spinning off its budget CMF brand, and plans to explore AI-centric devices while it keeps building smartphones and audio products. And Nothing claims it crossed $1 billion in cumulative revenue this year, up 150% from 2024.

The startup is working to be “IPO-ready” in three years, CEO Carl Pei told TechCrunch in an email. “The timing will depend on market conditions and what makes sense for the business at that point in time,” he said.

Techcrunch event

San Francisco
|
October 13-15, 2026

“What’s important is that we’re already operating with that discipline now. We’re building the systems, the governance, the financial discipline that a public company needs. It forces us to think longer-term and make smarter decisions that prioritise sustainable growth,” Pei added.

It’s not clear if Nothing aims to raise another round before an IPO. When asked about its fundraising plans, a Nothing spokesperson said the company is not thinking about raising capital immediately, but it wouldn’t be averse to those conversations.

Those interested in investing in the community round can use platforms like Wefunder and Crowdcube to participate.

Continue Reading

Trending