mstdn.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A general-purpose Mastodon server with a 500 character limit. All languages are welcome.

Administered by:

Server stats:

16K
active users

#h100

0 posts0 participants0 posts today

Sizing up #MI300A’s #GPU
It’s well ahead of #Nvidia’s #H100 PCIe for just about every major category of 32- or 64-bit operations. MI300A can achieve 113.2 TFLOPS of #FP32 throughput, with each FMA counting as two floating point operations. For comparison, H100 PCIe achieved 49.3 TFLOPS in same test.
#AMD cut down #MI300X’s GPU to create MI300A. 24 #Zen4 cores is a lot of #CPU power, and occupies one quadrant on the MI300 chip. But MI300’s main attraction is still the GPU.
chipsandcheese.com/p/sizing-up

Chips and Cheese · Sizing up MI300A’s GPUBy Chester Lam

#Huawei #HiSilicon #Ascend 910C is a version of the company's Ascend 910 processor for #AI training introduced in 2019. By now, the performance of the Ascend 910 is barely sufficient for the cost-efficient training of large AI models. Still, when it comes to inference, it delivers 60% of #Nvidia #H100 performance, according to researchers from #DeepSeek While the Ascend 910C is not a performance champion, it can succeed in reducing China's reliance on Nvidia #GPU's tomshardware.com/tech-industry

Tom's Hardware · DeepSeek research suggests Huawei's Ascend 910C delivers 60% of Nvidia H100 inference performanceBy Anton Shilov

Okay, loosing my mind here a bit. I just tested #OpenGL rendering under Linux on an #NVIDIA #H100 GPU, through #VirtualGL's #EGL backend.

And it worked... Renderer "NVIDIA H100/PCIe/SSE2", driver 555.42.06

I always understood the H100s to be incapable of OpenGL. But it seems I missed a crucial part in the H100 architecture doc (resources.nvidia.com/en-us-ten), shown in the image.

Except, I'm sure I tested OpenGL at some point under X, but it didn't work. So, did anything change (e.g. driver)?

Replied in thread

@PWS_1
Zur Zeit weiss noch niemand zu sagen, was hinter der sogenannten Kosteneffizienz von #DeepSeek steckt.
Bessere #Algorithmen? Adequatere chinesisch geframte #Trainingsdaten? Weit mehr Trainingskapazität qua mehr chinesischen Sklavenarbeitern (vgl. reverse lookup auf analogen Telefon&Adressdaten qua Transkriptionssklaven)? Oder hat #China tatsächlich doch Zugang zu genügend stromfressenden #H100-Ressourcen?
Ja, die Benchmarktests zu DeepSeek sind remarkable. #AI
m.youtube.com/watch?v=FJvSFTMN

m.youtube.com- YouTubeEnjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

Breakthrough in AI-Powered #Audio Generation and Transformation 🎵

🎹 #Fugatto, developed by #NVIDIA researchers, introduces universal sound manipulation through text prompts, handling music, voice & sound effects simultaneously

🎯 Advanced capabilities include accent modification, emotion control, and creation of never-before-heard sounds using #AI technology

🔧 Technical specs: 2.5B parameters, trained on #DGX systems with 32 #H100 GPUs, featuring ComposableART for instruction combination

🎨 Applications span #music production, game development, advertising & language learning - enables real-time audio asset generation & modification

💡 Developed by international team from India, Brazil, China, Jordan & South Korea, enhancing multi-accent & multilingual capabilities

#ai #ML

blogs.nvidia.com/blog/fugatto-

📺 youtu.be/qj1Sp8He6e4

NVIDIA Blog · Now Hear This: World’s Most Flexible Sound Machine DebutsFugatto generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.