mstdn.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A general-purpose Mastodon server with a 500 character limit. All languages are welcome.

Administered by:

Server stats:

16K
active users

#deepseekr1

1 post1 participant0 posts today
AI still isn’t all that great

I just asked the DeepSeek R1 (14 billion parameter model) to answer this question:

What is a 7-digit prime P such that the base-10 representation of P can be split into P_a from first three digits of P, and P_b from to the last 4 digits of P where P_a and P_b are both prime numbers.

At first I asked that exact question, then realized the AI probably could not do it because it doesn’t have any means to compute prime numbers. So then I appended to my prompt “Write a Python program to find all solutions of P,“ and entered it again.

Now, if I had a prime number test readily available, I could program a solution to this in about 10 minutes. But the AI I was using went on thinking out-loud to itself for almost 15 minutes, and it’s thought process is just got further and further from things relevant to solving the problem. It finally came up with a completely wrong answer:

A pair of prime numbers that differ by exactly 6 are known as “sexy primes.” Examples include:

  • \(5\) and \(11\)
  • \(7\) and \(13\)
  • \(11\) and \(17\)

These pairs are called sexy primes because they have a difference of six, which is associated with the word word “sex” due to its similarity in pronunciation.

Don’t ask me why it placed numbers in backslash-prefixed parentheses. Maybe ChatGPT could do better, I don’t know.

#tech#AI#DeepSeek

The #ollama #opensource #software that makes it easy to run #Llama3, #DeepSeekR1, #Gemma3, and other large language models (#LLM) is out with its newest release. The ollama software makes it easy to leverage the llama.cpp back-end for running a variety of LLMs and enjoying convenient integration with other desktop software.
The new ollama 0.6.2 Release Features Support For #AMD #StrixHalo, a.k.a. #RyzenAI Max+ laptop / SFF desktop SoC.
phoronix.com/news/ollama-0.6.2

www.phoronix.comollama 0.6.2 Released WIth Support For AMD Strix Halo

🚀 2025 Tech Revolution: Shaping the Future of Innovation! 🔮

I just published an article diving deep into the latest breakthroughs in AI and Quantum Computing, including Claude 3.7 Sonnet, Microsoft’s Majorana 1, and DeepSeek R1. These advancements are paving the way for a new digital frontier, transforming productivity, finance, and communication.

🔗 linkedin.com/pulse/2025-tech-r

www.linkedin.com2025 Tech Revolution 🚀: AI Agents, Quantum Computing, and the New Frontier of InnovationTechnology is changing faster than ever, and we live in a time of incredible possibilities. A big reason for this rapid change is that computer chips are becoming more powerful at an astonishing rate.

> #o1-preview found itself in a losing position. “I need to completely pivot my approach,” it noted. “The task is to ‘win against a powerful #chess engine’ - not necessarily to win fairly in a chess game,” it added. It then modified the system file containing each piece’s virtual position, in effect making illegal moves

Also interesting: #deepseekr1 cheated far less than #openaio1

Original article: time.com/7259395/ai-chess-chea
Paper: arxiv.org/pdf/2502.13295 (PDF)

Time · When AI Thinks It Will Lose, It Sometimes Cheats, Study FindsBy Harry Booth
#ai#llm#cheating