• AI Exec
  • Posts
  • DeepSeek R2 rumors are crazy

DeepSeek R2 rumors are crazy

90% on ARC-AGI test

👋 Good morning/evening (wherever you are). It’s Friday. There’s a lot…you’ll need the weekend to digest everything from today’s edition and also to catch up on previous newsletters.

The big news…

There’s already rumors that DeepSeek R2 scored ~90% on ARC-AGI.

If you’ve never heard of this, it’s a benchmark to see how good AI is.

As long as there are tasks which humans can do, but AI cannot, we do not have AGI. We measure and quantify that gap.

Here’s what’s coming (they’re already planning for 2026…)

ARC-AGI-1: Published in 2019. Endured for over 5 years. It was the basis for 4 global AI competitions (over 2K teams submitted solutions). Highlighted significant progress in the field.

ARC-AGI-2: Arriving March 2025. Similar format to ARC-AGI-1, but with more signal towards general intelligence. This includes tasks with more symbolic interpretation, local interactions, framing generalizations, and multi-step reasoning. These tasks are still feasible for humans, but are still challenging (10-20%) for state-of-the-art systems. ARC-AGI-2 will be the basis for the ARC Prize 2025 competition.

ARC-AGI-3: Coming 2026. A new paradigm to measure AI systems that can reason. Let the games begin.

Something to casually think about in the shower…when will AI create a benchmark that genius humans cannot easily solve?

OK let’s keep going ↓

Here’s what you should know:

The numbers:

  • Carbon Arc, a NYC-based AI data utility company, raised $56 Million in funding

  • AI and Machine Learning Inspiren banks $35 Million to scale AI-powered senior living technology

  • Buynomics raises $30 Million to expand ‘Virtual Shoppers’ system

  • Rerun raises $17 Million to build the data infrastructure for Physical AI

Thought starters:

On Thursday, New York State's technology bureau announced it's hired Shreya Amin, who most recently worked as an AI scientist for the health tech firm Wellist, as the state's new chief artificial intelligence officer.

Pennsylvania’s governor also talked about how AI is saving staff 8 hours per week.

Which states will make similar announcements next?

Meme of the day:

Thanks for reading,

Eddie

Are we friends on LinkedIn and X? Let’s connect — DM me or hit the follow button.

P.S. If this was valuable, forward it to a friend. If you’re that smart friend, subscribe here.

P.P.S. Interested in reaching other ambitious readers like you? To become a sponsor, reply to this email.