stux @stux

**Benjamin Carr, Ph.D.** @BenjaminHCCarr@hachyderm.io · Feb 26

Benjamin Carr, Ph.D. @BenjaminHCCarr@hachyderm.io

Sizing up #MI300A’s #GPU
It’s well ahead of #Nvidia’s #H100 PCIe for just about every major category of 32- or 64-bit operations. MI300A can achieve 113.2 TFLOPS of #FP32 throughput, with each FMA counting as two floating point operations. For comparison, H100 PCIe achieved 49.3 TFLOPS in same test.
#AMD cut down #MI300X’s GPU to create MI300A. 24 #Zen4 cores is a lot of #CPU power, and occupies one quadrant on the MI300 chip. But MI300’s main attraction is still the GPU.
https://chipsandcheese.com/p/sizing-up-mi300as-gpu

Chips and Cheese · Jan 20Sizing up MI300A’s GPUBy Chester Lam

**Dr. Moritz Lehmann** @ProjectPhysX@mast.hpc.social · Feb 23

**st1nger** @st1nger@infosec.exchange · Feb 5

Feb 5

st1nger @st1nger@infosec.exchange

#Huawei #HiSilicon #Ascend 910C is a version of the company's Ascend 910 processor for #AI training introduced in 2019. By now, the performance of the Ascend 910 is barely sufficient for the cost-efficient training of large AI models. Still, when it comes to inference, it delivers 60% of #Nvidia #H100 performance, according to researchers from #DeepSeek While the Ascend 910C is not a performance champion, it can succeed in reducing China's reliance on Nvidia #GPU's https://www.tomshardware.com/tech-industry/artificial-intelligence/deepseek-research-suggests-huaweis-ascend-910c-delivers-60-percent-nvidia-h100-inference-performance

Tom's Hardware · Feb 4DeepSeek research suggests Huawei's Ascend 910C delivers 60% of Nvidia H100 inference performanceBy Anton Shilov

**Paul Melis** @paulmelis@mastodon.gamedev.place · Feb 5 *

Feb 5 *

Paul Melis @paulmelis@mastodon.gamedev.place

Okay, loosing my mind here a bit. I just tested #OpenGL rendering under Linux on an #NVIDIA #H100 GPU, through #VirtualGL's #EGL backend.

And it worked... Renderer "NVIDIA H100/PCIe/SSE2", driver 555.42.06

I always understood the H100s to be incapable of OpenGL. But it seems I missed a crucial part in the H100 architecture doc (https://resources.nvidia.com/en-us-tensor-core), shown in the image.

Except, I'm sure I tested OpenGL at some point under X, but it didn't work. So, did anything change (e.g. driver)?

**Frankie** @Some_Emo_Chick@mastodon.social · Feb 1

**eicker.news tech news** @technews@eicker.news · Jan 31

**Pyrzout** @jos1264@social.skynetcloud.site · Jan 29

Replied in thread

**Johannes Reetz** @JohannesReetz@gruene.social · Jan 28 *

Jan 28 *

Johannes Reetz @JohannesReetz@gruene.social

@PWS_1
Zur Zeit weiss noch niemand zu sagen, was hinter der sogenannten Kosteneffizienz von #DeepSeek steckt.
Bessere #Algorithmen? Adequatere chinesisch geframte #Trainingsdaten? Weit mehr Trainingskapazität qua mehr chinesischen Sklavenarbeitern (vgl. reverse lookup auf analogen Telefon&Adressdaten qua Transkriptionssklaven)? Oder hat #China tatsächlich doch Zugang zu genügend stromfressenden #H100-Ressourcen?
Ja, die Benchmarktests zu DeepSeek sind remarkable. #AI
https://m.youtube.com/watch?v=FJvSFTMNTu4

m.youtube.com- YouTubeEnjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

**JimBSR** @jimlee@sfba.social · Dec 28, 2024

**michabbb** @michabbb@vivaldi.net · Nov 26, 2024

Nov 26, 2024

michabbb @michabbb@vivaldi.net

Breakthrough in AI-Powered #Audio Generation and Transformation

#Fugatto, developed by #NVIDIA researchers, introduces universal sound manipulation through text prompts, handling music, voice & sound effects simultaneously

Advanced capabilities include accent modification, emotion control, and creation of never-before-heard sounds using #AI technology

Technical specs: 2.5B parameters, trained on #DGX systems with 32 #H100 GPUs, featuring ComposableART for instruction combination

Applications span #music production, game development, advertising & language learning - enables real-time audio asset generation & modification

Developed by international team from India, Brazil, China, Jordan & South Korea, enhancing multi-accent & multilingual capabilities

#ai #ML

https://blogs.nvidia.com/blog/fugatto-gen-ai-sound-model/

https://youtu.be/qj1Sp8He6e4

NVIDIA Blog · Nov 25, 2024Now Hear This: World’s Most Flexible Sound Machine DebutsFugatto generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files.

**Websplaining** @Websplaining@mastodon.online · Nov 19, 2024

**Websplaining** @Websplaining@mastodon.online · Nov 19, 2024

**eicker.news tech news** @technews@eicker.news · Nov 1, 2024

**HPC Guru** @HPC_Guru@mastodon.social · Oct 30, 2024

**Torsten** @tor@norden.social · Oct 28, 2024 *

**HPC Guru** @HPC_Guru@mastodon.social · Oct 28, 2024

**HPC Guru** @HPC_Guru@mastodon.social · Oct 23, 2024

**Byzzyy** @Byzzyy@chaos.social · Oct 20, 2024

**Judith van Stegeren** @jd7h@fosstodon.org · Oct 18, 2024

Oct 18, 2024

Judith van Stegeren @jd7h@fosstodon.org

"TLDR: Don’t buy H100s. The market has flipped from shortage ($8/hr) to oversupplied ($2/hr), because of reserved compute resales, open model finetuning, and decline in new foundation model co’s. Rent instead."

https://www.latent.space/p/gpu-bubble

Latent Space · Oct 11, 2024$2 H100s: How the GPU Bubble BurstBy Eugene Cheah

#llms #h100 #nvidia

Recent searches

Search options

Administered by:

Server stats:

#h100