stux @stux

0 posts0 participants0 posts today

**LavX News** @lavxnews@mastodon.cloud · Apr 14

Harnessing AI for Basketball: PlayBook AI Revolutionizes Game Analysis

PlayBook AI is set to transform how basketball professionals analyze plays, assess player fit, and understand game dynamics. By leveraging advanced technologies like LSTM neural networks and multi-obj...

https://news.lavx.hu/article/harnessing-ai-for-basketball-playbook-ai-revolutionizes-game-analysis

#news #tech #AWS

**Habr** @habr@zhub.link · Apr 9

Apr 9

Habr @habr@zhub.link

UEBA в кибербезе: как профилирование поведения пользователей на основе Autoencoder помогает выявлять угрозы и аномалии

В современном мире количество атак растёт пропорционально количеству внедрений новых технологий, особенно когда технологии ещё недостаточно изучены. В последнее время атаки становятся всё более разнообразными, а методы их реализации — всё более изощрёнными. Дополнительные проблемы несут и методы искусственного интеллекта, которыми вооружаются специалисты RedTeam. В руках опытного специалиста эти инструменты становятся реальной угрозой безопасности потенциальных целей. Большинство средств информационной безопасности основаны на корреляционных или статистических методах, которые в современных реалиях часто оказываются неэффективными. Что же тогда остаётся специалистам BlueTeam?

https://habr.com/ru/companies/gaz-is/articles/899058/

ХабрUEBA в кибербезе: как профилирование поведения пользователей на основе Autoencoder помогает выявлять угрозы и аномалииВ современном мире количество атак растёт пропорционально количеству внедрений новых технологий, особенно когда технологии ещё недостаточно изучены. В последнее время атаки становятся всё более...

#газинформсервис #информационная_безопасность #ueba

**Olusegun Oyekanmi** @OluOyekanmi@flipboard.com · Mar 30

Mar 30

Olusegun Oyekanmi @OluOyekanmi@flipboard.com

A Hierarchical conv-LSTM and LLM Integrated Model for Holistic Stock For... https://youtu.be/G-MLSchOaCo?feature=shared via @YouTube #neuralnetworks #stockmarketpredictions #llm #lstm #cnn

https://www.youtube.com/watch?v=G-MLSchOaCo&utm_source=flipboard&utm_medium=activitypub

Posted into AIOLOGY @aiology-OluOyekanmi

YouTubeA Hierarchical conv-LSTM and LLM Integrated Model for Holistic Stock ForecastingBy Nosa Capital

**Habr** @habr@zhub.link · Feb 26

Feb 26

Habr @habr@zhub.link

Первая ИИ-модель для обучения на тексте

Привет, будущие разработчики! Сегодня я расскажу вам, как создать свою первую модель искусственного интеллекта. Это материал совсем для начинающих, так что не переживайте — никаких сложных терминов и запутанных выражений. Всё, что понадобится, — ваши идеи и немного кода. Будем писать на Python и использовать TensorFlow — мощную библиотеку от Google для машинного обучения.

https://habr.com/ru/companies/otus/articles/885390/

#python #ИИ #tensorflow

**Alauddin Maulana Hirzan** @maulanahirzan@bsd.cafe · Dec 28, 2024

Dec 28, 2024

Alauddin Maulana Hirzan @maulanahirzan@bsd.cafe

Training 5 different LSTM models with Python3, PyTorch, and FreeBSD. I turned off X to reclaim more resources during model training. Since I did not know how to screen capture in CLI mode, let's settle with phone's camera. #LSTM #FreeBSD #NeuralNetwork

**Habr** @habr@zhub.link · Nov 10, 2024

Nov 10, 2024

Habr @habr@zhub.link

Сердце насоса склонно к износу: предиктивная аналитика как гарант надёжности оборудования

Износ, старение и простои насосного оборудования создают серьёзные проблемы для многих предприятий, влияя на производительность и увеличивая затраты. В этой статье мы расскажем о нашем опыте использования предиктивного анализа на основе нейросетей LSTM для прогнозирования состояния насосов. Узнать об опыте

https://habr.com/ru/articles/857442/

ХабрСердце насоса склонно к износу: предиктивная аналитика как гарант надёжности оборудованияИзнос, старение и простой насосного оборудования, представляет серьёзную проблему для многих предприятий. В условиях, когда обновление парка оборудования невозможно из‑за финансовых,...

#прогнозирование_временных_рядов #анализ_данных #машинное_обучение

**JMLR** @jmlr@sigmoid.social · Oct 29, 2024

Oct 29, 2024

JMLR @jmlr@sigmoid.social

'MLRegTest: A Benchmark for the Machine Learning of Regular Languages', by Sam van der Poel et al.

http://jmlr.org/papers/v25/23-0518.html

#lstm #mlregtest #rnn

**Habr** @habr@zhub.link · Oct 16, 2024

Oct 16, 2024

Habr @habr@zhub.link

Продолжение исследования RNN

С прошлой статьи я внёс несколько изменений: 1. Планировщик был сломан и не изменял скорость. Починил. 2. Остаточное соединение через умножение. 3. WindowedDense для выходной проекции. 4. Добавил clipnorm 1, cutoff_rate 0.4 Как обычно это всё добавляет стабильности и 1% точности. WindowedDense по неизвестной мне причине добавляет SMR стабильность.

https://habr.com/ru/articles/851182/

ХабрПродолжение исследования RNNС прошлой статьи я внёс несколько изменений: 1. Планировщик был сломан и не изменял скорость. Починил. 2. Остаточное соединение через умножение. 3. WindowedDense для выходной проекции. 4. Добавил...

#rnn #lstm #gru

**Habr** @habr@zhub.link · Oct 5, 2024

Oct 5, 2024

Habr @habr@zhub.link

Рекурретные нейронные сети наносят ответный удар

Рекуррентные нейронные сети (RNN), а также ее наследники такие, как LSTM и GRU, когда-то были основными инструментами для работы с последовательными данными. Однако в последние годы они были почти полностью вытеснены трансформерами (восхождение Attention is all you need ), которые стали доминировать в областях от обработки естественного языка до компьютерного зрения. В статье " Were RNNs All We Needed ?" авторы пересматривают потенциал RNN, адаптируя её под параллельные вычисления. Рассмотрим детальнее, в чем же они добились успеха.

https://habr.com/ru/articles/848480/

ХабрРекурретные нейронные сети наносят ответный ударРекуррентные нейронные сети (RNN), а также ее наследники такие, как LSTM и GRU, когда-то были основными инструментами для работы с последовательными данными. Однако в последние годы они были почти...

#рекуррентные_нейронные_сети #lstm #gru

**aijobs.net** @aijobs · Jul 22, 2024

Jul 22, 2024

aijobs.net @aijobs

HIRING: Machine Learning Engineer, Web Ads and Open-loop E-Commerce -USDS / Mountain View
USD 228K+

https://ai-jobs.net/J354613/

ai-jobs.netMachine Learning Engineer, Web Ads and Open-loop E-Commerce -USDS at TikTok - Mountain ViewTikTok is hiring for Full Time Machine Learning Engineer, Web Ads and Open-loop E-Commerce -USDS - Mountain View, a senior-level AI/ML/Data Science role offering benefits such as 401(k) matching, career development, equity / stock options, flex hours, flexible spending account, flex vacation, health care, insurance, medical leave, parental leave, transparency, travel.

#Architecture #ComputerScience #DeepLearning

**Knowledge Zone** @kzoneind · Jul 1, 2024

Jul 1, 2024

Knowledge Zone @kzoneind

#ITByte: The #MachineLearning models having sequential data as input or output are called #SequenceModels.

It includes text streams, video clips, audio clips, time-series data, etc. Recurrent Neural Networks (#RNNs) and Long Short-Term Memory(#LSTM) are popular algorithms used in sequence models.

https://knowledgezone.co.in/trends/explorer?topic=Sequence-Model

**AIME** @AIME_hq@mastodon.social · May 10, 2024

May 10, 2024

AIME @AIME_hq@mastodon.social

xLSTM is a combination of transformer technology and long-term memory. The result is an architecture that performs better in terms of performance and scalability than the transformers currently in use, the researchers write.

https://arxiv.org/abs/2405.04517

arXiv.orgxLSTM: Extended Long Short-Term MemoryIn the 1990s, the constant error carousel and gating were introduced as the central ideas of the Long Short-Term Memory (LSTM). Since then, LSTMs have stood the test of time and contributed to numerous deep learning success stories, in particular they constituted the first Large Language Models (LLMs). However, the advent of the Transformer technology with parallelizable self-attention at its core marked the dawn of a new era, outpacing LSTMs at scale. We now raise a simple question: How far do we get in language modeling when scaling LSTMs to billions of parameters, leveraging the latest techniques from modern LLMs, but mitigating known limitations of LSTMs? Firstly, we introduce exponential gating with appropriate normalization and stabilization techniques. Secondly, we modify the LSTM memory structure, obtaining: (i) sLSTM with a scalar memory, a scalar update, and new memory mixing, (ii) mLSTM that is fully parallelizable with a matrix memory and a covariance update rule. Integrating these LSTM extensions into residual block backbones yields xLSTM blocks that are then residually stacked into xLSTM architectures. Exponential gating and modified memory structures boost xLSTM capabilities to perform favorably when compared to state-of-the-art Transformers and State Space Models, both in performance and scaling.

#LSTM #LLM #DeepLearning

**Michael Fauscette** @mfauscette@techhub.social · May 10, 2024

May 10, 2024

Michael Fauscette @mfauscette@techhub.social

The Inventor of LSTM Unveils New Architecture for LLMs to Replace Transformers
https://zurl.co/Y8Ua
#ai #genai #llm #lstm

Analytics India Magazine · May 8, 2024The Inventor of LSTM Unveils New Architecture for LLMs to Replace TransformersSepp Hochreiter, the inventor of LSTM, has unveiled a new LLM architecture, featuring a significant innovation: xLSTM.

**kcnickerson** @kcnickerson@mastodon.social · May 9, 2024

May 9, 2024

kcnickerson @kcnickerson@mastodon.social

Interesting if not ironic proof that OpenAI does not employ LSTM ;> @jason_koebler @openai @404mediaco #lstm #openai #closedai #copywrong https://www.404media.co/openai-files-copyright-claim-against-chatgpt-subreddit/

404 Media · May 9, 2024OpenAI, Mass Scraper of Copyrighted Work, Claims Copyright Over Subreddit's Logo“It does not seem wise for OpenAI to start enforcing copyright claims,” one Redditor wrote.

**Eva Wolfangel** @evawolfangel@chaos.social · May 9, 2024

May 9, 2024

Eva Wolfangel @evawolfangel@chaos.social

Das war eine sehr unterhaltsame Recherche, bei der ich Sepp Hochreiter getroffen habe - ein Pionier des maschinellen Lernens, der mit seiner alten Idee (#lstm) jetzt OpenAi „vom Markt fegen“ will.

Ob dieser alte Algorithmus wirklich das Zeug dazu hat, große Sprachmodelle zu revolutionieren, kann ich schwer einschätzen. Was mir aber immer klarer wurde in letzter Zeit: Transformermodelle sind an ihrer Grenze. Von daher wird sich was bewegen müssen.

https://www.zeit.de/digital/2024-05/sepp-hochreiter-kuenstliche-intelligenz-chatgpt-nxai-lstm/komplettansicht

ZEIT ONLINESepp Hochreiter: Sepp strikes backSepp Hochreiter hat in den Neunzigerjahren die künstliche Intelligenz revolutioniert. Dann kamen andere. Jetzt will er wieder angreifen und ein besseres ChatGPT bauen.

#chatGPT #openAi

**Alex Jimenez** @AlexJimenez@mas.to · May 8, 2024

May 8, 2024

Alex Jimenez @AlexJimenez@mas.to

The Inventor of #LSTM Unveils New Architecture for #LLMs to Replace Transformers

One of the most important aspects of the xLSTM architecture is its flexible ratio of MLSTM and SLSTM blocks.

https://analyticsindiamag.com/the-inventor-of-lstm-unveils-new-architecture-for-llms-to-replace-transformers/?utm_source=flipboard

#AI

**hydriv_lwi** @hydriv@mas.to · Mar 8, 2024

Mar 8, 2024

hydriv_lwi @hydriv@mas.to

Welcome Julius Engelmann! After completing his Master thesis on #LSTM networks for rainfall-runoff modelling @tubraunschweig he will contribute with his expertise in #programming, #machinelearning and #hydrology to our freshly launched projects DO IT! and METAScales.

**amen zwa, esq.** @AmenZwa@mathstodon.xyz · Nov 20, 2023 *

Nov 20, 2023 *

amen zwa, esq. @AmenZwa@mathstodon.xyz

I recently found on Cornell #arXive a new pre-print (2023) on #RNN and #LSTM by Alex Sherstinsky of MIT. Through the years, I've read numerous papers on RNNs, starting with Rumelhart's 1986 paper. But this one is, by far, the most detailed tutorial not only on RNNs but also on LSTMs.

The complete derivations of both forward (inference) and backward (training) passes of the learning algorithm use only basic calculus and matrix algebra, drawing intuitive analogies to digital signal processing #DSP. And the equations are complete and detailed enough to be implemented by the student, directly in software. In my opinion, every undergrad EE and CS studying #DeepLearning #NeuralNetworks should read this superb introduction.

https://arxiv.org/pdf/1808.03314.pdf

**Alan Buxton** @alanbuxton@mastodon.online · Nov 12, 2023

Nov 12, 2023

Alan Buxton @alanbuxton@mastodon.online

Quite happy with how my little side-project has turned out so far.

It started off as an itch I wanted to scratch about named entity recognition, took me through #lstm to #transformer to #graphdata etc. Been a lot of fun and I've learnt a lot.

https://syracuse.1145.am

**HagollEO** @ohagolle@fediscience.org · Sep 3, 2023

Sep 3, 2023

HagollEO @ohagolle@fediscience.org

New paper ! We measured expected directional effects on thermal infrared satellite images from #TRISHNA, #LSTM or #SBG missions, using simultaneous acquisitions from #LANDSAT and an aerial imager #MASTER from @NASAJPL

Differences up to 4.5 degrees in #TRISHNA field of view have been observed, which can be corrected to less than 2°K using very simple models.

For more details and to access the paper : https://labo.obs-mip.fr/multitemp/hr-tir-da-irl-high-resolution-thermal-infra-red-directional-anisotropy-in-real-life/

Recent searches

Search options

Administered by:

Server stats:

#lstm