Mastodon 🐘

Hacker NewsJust Ask for Generalization<a href="https://evjang.com/2021/10/23/generalization.html" rel="nofollow noopener" translate="no" target="_blank">https://evjang.com/2021/10/23/generalization.html</a><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#HackerNews</a> <a href="https://mastodon.social/tags/Just" class="mention hashtag" rel="nofollow noopener" target="_blank">#Just</a> <a href="https://mastodon.social/tags/Ask" class="mention hashtag" rel="nofollow noopener" target="_blank">#Ask</a> <a href="https://mastodon.social/tags/for" class="mention hashtag" rel="nofollow noopener" target="_blank">#for</a> <a href="https://mastodon.social/tags/Generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#Generalization</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/Generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#Generalization</a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://mastodon.social/tags/DataScience" class="mention hashtag" rel="nofollow noopener" target="_blank">#DataScience</a> <a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#HackerNews</a>

nf-corePipeline release! nf-core/drugresponseeval v1.1.0 - Drugresponseeval 1.1.0 - Humongous Zapdos!Please see the changelog: <a href="https://github.com/nf-core/drugresponseeval/releases/tag/1.1.0" rel="nofollow noopener" translate="no" target="_blank">https://github.com/nf-core/drugresponseeval/releases/tag/1.1.0</a><a href="https://mstdn.science/tags/celllines" class="mention hashtag" rel="nofollow noopener" target="_blank">#celllines</a> <a href="https://mstdn.science/tags/crossvalidation" class="mention hashtag" rel="nofollow noopener" target="_blank">#crossvalidation</a> <a href="https://mstdn.science/tags/deeplearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#deeplearning</a> <a href="https://mstdn.science/tags/drugresponse" class="mention hashtag" rel="nofollow noopener" target="_blank">#drugresponse</a> <a href="https://mstdn.science/tags/drugresponseprediction" class="mention hashtag" rel="nofollow noopener" target="_blank">#drugresponseprediction</a> <a href="https://mstdn.science/tags/drugs" class="mention hashtag" rel="nofollow noopener" target="_blank">#drugs</a> <a href="https://mstdn.science/tags/fairprinciples" class="mention hashtag" rel="nofollow noopener" target="_blank">#fairprinciples</a> <a href="https://mstdn.science/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a> <a href="https://mstdn.science/tags/hyperparametertuning" class="mention hashtag" rel="nofollow noopener" target="_blank">#hyperparametertuning</a> <a href="https://mstdn.science/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#machinelearning</a> <a href="https://mstdn.science/tags/randomizationtests" class="mention hashtag" rel="nofollow noopener" target="_blank">#randomizationtests</a> <a href="https://mstdn.science/tags/robustnessassessment" class="mention hashtag" rel="nofollow noopener" target="_blank">#robustnessassessment</a> <a href="https://mstdn.science/tags/training" class="mention hashtag" rel="nofollow noopener" target="_blank">#training</a> <a href="https://mstdn.science/tags/nfcore" class="mention hashtag" rel="nofollow noopener" target="_blank">#nfcore</a> <a href="https://mstdn.science/tags/openscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#openscience</a> <a href="https://mstdn.science/tags/nextflow" class="mention hashtag" rel="nofollow noopener" target="_blank">#nextflow</a> <a href="https://mstdn.science/tags/bioinformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#bioinformatics</a>

JMLR'Random Pruning Over-parameterized Neural Networks Can Improve Generalization: A Training Dynamics Analysis', by Hongru Yang, Yingbin Liang, Xiaojie Guo, Lingfei Wu, Zhangyang Wang.<a href="http://jmlr.org/papers/v26/23-0832.html" rel="nofollow noopener" translate="no" target="_blank">http://jmlr.org/papers/v26/23-0832.html</a> <a href="https://sigmoid.social/tags/pruning" class="mention hashtag" rel="nofollow noopener" target="_blank">#pruning</a> <a href="https://sigmoid.social/tags/pruned" class="mention hashtag" rel="nofollow noopener" target="_blank">#pruned</a> <a href="https://sigmoid.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>

PLOS BiologyHumans can apply solutions of past problems to new problems. <a href="https://fediscience.org/@gershbrain" class="u-url mention" rel="nofollow noopener" target="_blank">@gershbrain</a> <a href="https://mastodon.online/@nicoschuck" class="u-url mention" rel="nofollow noopener" target="_blank">@nicoschuck</a> &co reveal the neural correlates of <a href="https://fediscience.org/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a> and show that humans apply past policies in a reward-sensitive manner that leads to high performance <a href="https://fediscience.org/@PLOSBiology" class="u-url mention" rel="nofollow noopener" target="_blank">@PLOSBiology</a> <a href="https://plos.io/3SJPMof" rel="nofollow noopener" translate="no" target="_blank">https://plos.io/3SJPMof</a>

Hacker Newsπ0.5: A VLA with open-world generalization<a href="https://pi.website/blog/pi05" rel="nofollow noopener" translate="no" target="_blank">https://pi.website/blog/pi05</a><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#HackerNews</a> <a href="https://mastodon.social/tags/%CF%800" class="mention hashtag" rel="nofollow noopener" target="_blank">#π0</a>.5 <a href="https://mastodon.social/tags/VLA" class="mention hashtag" rel="nofollow noopener" target="_blank">#VLA</a> <a href="https://mastodon.social/tags/openworld" class="mention hashtag" rel="nofollow noopener" target="_blank">#openworld</a> <a href="https://mastodon.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a> <a href="https://mastodon.social/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#machinelearning</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a>

Games at Work dot bize509 — Maverick and Marblese509 with Michael and Michael - stories and discussion all around <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a>, <a href="https://mastodon.social/tags/LLMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLMs</a>, <a href="https://mastodon.social/tags/llamas" class="mention hashtag" rel="nofollow noopener" target="_blank">#llamas</a>, generated <a href="https://mastodon.social/tags/Quake" class="mention hashtag" rel="nofollow noopener" target="_blank">#Quake</a>, <a href="https://mastodon.social/tags/grokking" class="mention hashtag" rel="nofollow noopener" target="_blank">#grokking</a>, <a href="https://mastodon.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a> and much more.<a href="https://gamesatwork.biz/2025/04/14/e509-maverick-and-marbles/" rel="nofollow noopener" translate="no" target="_blank">https://gamesatwork.biz/2025/04/14/e509-maverick-and-marbles/</a>

Michael Martinee509 — Maverick and Marblese509 with Michael and Michael - stories and discussion all around <a href="https://mstdn.social/tags/AI" class="mention hashtag" rel="tag">#AI</a>, <a href="https://mstdn.social/tags/LLMs" class="mention hashtag" rel="tag">#LLMs</a>, <a href="https://mstdn.social/tags/llamas" class="mention hashtag" rel="tag">#llamas</a>, generated <a href="https://mstdn.social/tags/Quake" class="mention hashtag" rel="tag">#Quake</a>, <a href="https://mstdn.social/tags/grokking" class="mention hashtag" rel="tag">#grokking</a>, <a href="https://mstdn.social/tags/generalization" class="mention hashtag" rel="tag">#generalization</a> and much more.<a href="https://gamesatwork.biz/2025/04/14/e509-maverick-and-marbles/" target="_blank" rel="nofollow noopener" translate="no">https://gamesatwork.biz/2025/04/14/e509-maverick-and-marbles/</a>

nf-corePipeline release! nf-core/drugresponseeval v1.0.0 - 1.0.0!Please see the changelog: <a href="https://github.com/nf-core/drugresponseeval/releases/tag/1.0.0" rel="nofollow noopener" translate="no" target="_blank">https://github.com/nf-core/drugresponseeval/releases/tag/1.0.0</a><a href="https://mstdn.science/tags/celllines" class="mention hashtag" rel="nofollow noopener" target="_blank">#celllines</a> <a href="https://mstdn.science/tags/crossvalidation" class="mention hashtag" rel="nofollow noopener" target="_blank">#crossvalidation</a> <a href="https://mstdn.science/tags/deeplearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#deeplearning</a> <a href="https://mstdn.science/tags/drugresponse" class="mention hashtag" rel="nofollow noopener" target="_blank">#drugresponse</a> <a href="https://mstdn.science/tags/drugresponseprediction" class="mention hashtag" rel="nofollow noopener" target="_blank">#drugresponseprediction</a> <a href="https://mstdn.science/tags/drugs" class="mention hashtag" rel="nofollow noopener" target="_blank">#drugs</a> <a href="https://mstdn.science/tags/fairprinciples" class="mention hashtag" rel="nofollow noopener" target="_blank">#fairprinciples</a> <a href="https://mstdn.science/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a> <a href="https://mstdn.science/tags/hyperparametertuning" class="mention hashtag" rel="nofollow noopener" target="_blank">#hyperparametertuning</a> <a href="https://mstdn.science/tags/machinelearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#machinelearning</a> <a href="https://mstdn.science/tags/randomizationtests" class="mention hashtag" rel="nofollow noopener" target="_blank">#randomizationtests</a> <a href="https://mstdn.science/tags/robustnessassessment" class="mention hashtag" rel="nofollow noopener" target="_blank">#robustnessassessment</a> <a href="https://mstdn.science/tags/training" class="mention hashtag" rel="nofollow noopener" target="_blank">#training</a> <a href="https://mstdn.science/tags/nfcore" class="mention hashtag" rel="nofollow noopener" target="_blank">#nfcore</a> <a href="https://mstdn.science/tags/openscience" class="mention hashtag" rel="nofollow noopener" target="_blank">#openscience</a> <a href="https://mstdn.science/tags/nextflow" class="mention hashtag" rel="nofollow noopener" target="_blank">#nextflow</a> <a href="https://mstdn.science/tags/bioinformatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#bioinformatics</a>

Sean MurthyPeople value us for the value (they believe) we (might) add to them. Generalizing of course, but it's all transactional. There's no (longer) valuing people for just who they are.<a href="https://hachyderm.io/tags/society" class="mention hashtag" rel="nofollow noopener" target="_blank">#society</a> <a href="https://hachyderm.io/tags/people" class="mention hashtag" rel="nofollow noopener" target="_blank">#people</a> <a href="https://hachyderm.io/tags/life" class="mention hashtag" rel="nofollow noopener" target="_blank">#life</a> <a href="https://hachyderm.io/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>

Victoria Stuart 🇨🇦 🏳️‍⚧️Grokking at Edge of Numerical Stability <a href="https://arxiv.org/abs/2501.04697" rel="nofollow noopener" translate="no" target="_blank">https://arxiv.org/abs/2501.04697</a> <a href="https://old.reddit.com/r/MachineLearning/comments/1i34keg/grokking_at_the_edge_of_numerical_stability" rel="nofollow noopener" translate="no" target="_blank">https://old.reddit.com/r/MachineLearning/comments/1i34keg/grokking_at_the_edge_of_numerical_stability</a> <a href="https://en.wikipedia.org/wiki/Grokking_(machine_learning)" rel="nofollow noopener" translate="no" target="_blank">https://en.wikipedia.org/wiki/Grokking_(machine_learning)</a>* sudden generalization after prolonged overfitting * massively overtrained NN can acq. "emergent"/supra performance/unexpected abilities * unexp./accid. finding * mechanisms starting to unravelGrokked Transformers are Implicit Reasoners: Mechanistic Journey to Edge of Generalization <a href="https://arxiv.org/abs/2405.15071" rel="nofollow noopener" translate="no" target="_blank">https://arxiv.org/abs/2405.15071</a> <a href="https://news.ycombinator.com/item?id=40495149" rel="nofollow noopener" translate="no" target="_blank">https://news.ycombinator.com/item?id=40495149</a><a href="https://mastodon.social/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#LLM</a> <a href="https://mastodon.social/tags/ML" class="mention hashtag" rel="nofollow noopener" target="_blank">#ML</a> <a href="https://mastodon.social/tags/grokking" class="mention hashtag" rel="nofollow noopener" target="_blank">#grokking</a> <a href="https://mastodon.social/tags/NN" class="mention hashtag" rel="nofollow noopener" target="_blank">#NN</a> <a href="https://mastodon.social/tags/emergence" class="mention hashtag" rel="nofollow noopener" target="_blank">#emergence</a> <a href="https://mastodon.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>

Different ThanA post from August 2024 by <a href="https://mastodon.social/@grimalkina" class="u-url mention" rel="nofollow noopener" target="_blank">@grimalkina</a>, boosted by someone on another instance, about why to report demographics in research even when you're not studying those groups. This seems like a great primer for people who have little background in basic <a href="https://infosec.exchange/tags/sampling" class="mention hashtag" rel="nofollow noopener" target="_blank">#sampling</a> and <a href="https://infosec.exchange/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a> (for some reason I can't link/boost from here, so):<a href="https://mastodon.social/@grimalkina/112966685297897685" rel="nofollow noopener" translate="no" target="_blank">https://mastodon.social/@grimalkina/112966685297897685</a>My 2 cents (already at least partially covered by Dr. Hicks): 1. Your study is never just about your study. Good science is <a href="https://infosec.exchange/tags/open" class="mention hashtag" rel="nofollow noopener" target="_blank">#open</a> and reusable. e.g., maybe your study on tech-enabled healthcare access isn't specifically about LGBTQ+ or Hispanic people, but what are you doing to help a researcher who comes along in 10 years? That information will change what they find and report.2. Marginalized groups are often minorities, meaning representative probability samples (or --uncomfortable gesture-- convenience samples) for bread-and-butter research frequently have subpopulations too small for reasonable power in correlations, group differences, etc. That's just reality. It's also a big problem for our understanding of <a href="https://infosec.exchange/tags/marginalized" class="mention hashtag" rel="nofollow noopener" target="_blank">#marginalized</a> + <a href="https://infosec.exchange/tags/minority" class="mention hashtag" rel="nofollow noopener" target="_blank">#minority</a> groups. Oversampling or targeted studies of those groups are important. It's also important to have a large number of less-targeted studies with relevant information that can be synthesized later (see #1): one study with 1.3% trans participants doesn't tell us much about the trans population, but 20 studies, each of which has 1.3% trans participants, could tell us meaningful things.3. Representation is important. My belief is that <a href="https://infosec.exchange/tags/marginalized" class="mention hashtag" rel="nofollow noopener" target="_blank">#marginalized</a>+minoritized people need their identities and existence public and constant. In <a href="https://infosec.exchange/tags/science" class="mention hashtag" rel="nofollow noopener" target="_blank">#science</a>, both they and other people consuming the research will benefit from being reminded that they are there, almost always, in our <a href="https://infosec.exchange/tags/research" class="mention hashtag" rel="nofollow noopener" target="_blank">#research</a>.

JMLR'Generalization on the Unseen, Logic Reasoning and Degree Curriculum', by Emmanuel Abbe, Samy Bengio, Aryo Lotfi, Kevin Rizk.<a href="http://jmlr.org/papers/v25/24-0220.html" rel="nofollow noopener" translate="no" target="_blank">http://jmlr.org/papers/v25/24-0220.html</a> <a href="https://sigmoid.social/tags/sparse" class="mention hashtag" rel="nofollow noopener" target="_blank">#sparse</a> <a href="https://sigmoid.social/tags/learns" class="mention hashtag" rel="nofollow noopener" target="_blank">#learns</a> <a href="https://sigmoid.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>

JMLR'Mentored Learning: Improving Generalization and Convergence of Student Learner', by Xiaofeng Cao, Yaming Guo, Heng Tao Shen, Ivor W. Tsang, James T. Kwok.<a href="http://jmlr.org/papers/v25/23-1213.html" rel="nofollow noopener" translate="no" target="_blank">http://jmlr.org/papers/v25/23-1213.html</a> <a href="https://sigmoid.social/tags/learners" class="mention hashtag" rel="nofollow noopener" target="_blank">#learners</a> <a href="https://sigmoid.social/tags/learner" class="mention hashtag" rel="nofollow noopener" target="_blank">#learner</a> <a href="https://sigmoid.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>

JMLR'Neural Networks with Sparse Activation Induced by Large Bias: Tighter Analysis with Bias-Generalized NTK', by Hongru Yang, Ziyu Jiang, Ruizhe Zhang, Yingbin Liang, Zhangyang Wang.<a href="http://jmlr.org/papers/v25/23-0831.html" rel="nofollow noopener" translate="no" target="_blank">http://jmlr.org/papers/v25/23-0831.html</a> <a href="https://sigmoid.social/tags/sparse" class="mention hashtag" rel="nofollow noopener" target="_blank">#sparse</a> <a href="https://sigmoid.social/tags/gradient" class="mention hashtag" rel="nofollow noopener" target="_blank">#gradient</a> <a href="https://sigmoid.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>

Jan Vlug<a href="https://mastodon.social/@schizanon" class="u-url mention" rel="nofollow noopener" target="_blank">@schizanon</a> <a href="https://en.osm.town/@strebski" class="u-url mention" rel="nofollow noopener" target="_blank">@strebski</a> <a href="https://chaos.social/@fossdd" class="u-url mention" rel="nofollow noopener" target="_blank">@fossdd</a> I think <a href="https://mastodon.social/tags/nationalism" class="mention hashtag" rel="nofollow noopener" target="_blank">#nationalism</a> and <a href="https://mastodon.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a> are important factors for war and killing. I try to treat living beings as <a href="https://mastodon.social/tags/individuals" class="mention hashtag" rel="nofollow noopener" target="_blank">#individuals</a>.

Jim Donegan 🎵 ✅<a href="https://mastodon.scot/tags/STARTREK" class="mention hashtag" rel="nofollow noopener" target="_blank">#STARTREK</a> <a href="https://mastodon.scot/tags/LogicalThinking" class="mention hashtag" rel="nofollow noopener" target="_blank">#LogicalThinking</a> #70 - Proof By Example (Inappropriate <a href="https://mastodon.scot/tags/Generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#Generalization</a>)<a href="https://www.youtube.com/watch?v=NjntoaujuF0" rel="nofollow noopener" translate="no" target="_blank">https://www.youtube.com/watch?v=NjntoaujuF0</a><a href="https://mastodon.scot/tags/Trek" class="mention hashtag" rel="nofollow noopener" target="_blank">#Trek</a> <a href="https://mastodon.scot/tags/LogicalThinking" class="mention hashtag" rel="nofollow noopener" target="_blank">#LogicalThinking</a> <a href="https://mastodon.scot/tags/Philosophy" class="mention hashtag" rel="nofollow noopener" target="_blank">#Philosophy</a> <a href="https://mastodon.scot/tags/Spock" class="mention hashtag" rel="nofollow noopener" target="_blank">#Spock</a> <a href="https://mastodon.scot/tags/Enterprise" class="mention hashtag" rel="nofollow noopener" target="_blank">#Enterprise</a> <a href="https://mastodon.scot/tags/TAS" class="mention hashtag" rel="nofollow noopener" target="_blank">#TAS</a> <a href="https://mastodon.scot/tags/StarTrekTAS" class="mention hashtag" rel="nofollow noopener" target="_blank">#StarTrekTAS</a> <a href="https://mastodon.scot/tags/TheAnimatedSeries" class="mention hashtag" rel="nofollow noopener" target="_blank">#TheAnimatedSeries</a>

HabrМогут ли трансформеры «думать»Недавние исследования показывают, что модели трансформеров способны почти безошибочно решать задачи, требующие нескольких логических шагов. Например, из утверждения А вывести Б и дойти логически до В. И что удивительно, это достигается без использования Chain-of-Thought или особых промптов — только классический GPT-2. Давайте посмотрим, как трансформеры «думают» при решении задач рассуждения, и напишем для этого код с использованием библиотеки Hugging Face.<a href="https://habr.com/ru/articles/840136/" rel="nofollow noopener" translate="no" target="_blank">https://habr.com/ru/articles/840136/</a><a href="https://zhub.link/tags/GPT" class="mention hashtag" rel="nofollow noopener" target="_blank">#GPT</a> <a href="https://zhub.link/tags/%D0%B3%D1%80%D0%BE%D0%BA%D0%B8%D0%BD%D0%B3" class="mention hashtag" rel="nofollow noopener" target="_blank">#грокинг</a> <a href="https://zhub.link/tags/%D0%BF%D0%B0%D0%BC%D1%8F%D1%82%D1%8C_%D0%98%D0%98" class="mention hashtag" rel="nofollow noopener" target="_blank">#память_ИИ</a> <a href="https://zhub.link/tags/%D0%B7%D0%B0%D0%B4%D0%B0%D1%87%D0%B8_%D1%80%D0%B0%D1%81%D1%81%D1%83%D0%B6%D0%B4%D0%B5%D0%BD%D0%B8%D1%8F" class="mention hashtag" rel="nofollow noopener" target="_blank">#задачи_рассуждения</a> <a href="https://zhub.link/tags/%D0%BE%D0%B1%D1%89%D0%B8%D0%B9_%D0%B8%D1%81%D0%BA%D1%83%D1%81%D1%81%D1%82%D0%B2%D0%B5%D0%BD%D0%BD%D1%8B%D0%B9_%D0%B8%D0%BD%D1%82%D0%B5%D0%BB%D0%BB%D0%B5%D0%BA%D1%82" class="mention hashtag" rel="nofollow noopener" target="_blank">#общий_искусственный_интеллект</a> <a href="https://zhub.link/tags/%D0%BE%D0%B1%D0%BE%D0%B1%D1%89%D0%B5%D0%BD%D0%B8%D0%B5" class="mention hashtag" rel="nofollow noopener" target="_blank">#обобщение</a> <a href="https://zhub.link/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a> <a href="https://zhub.link/tags/%D1%82%D1%80%D0%B0%D0%BD%D1%81%D1%84%D0%BE%D1%80%D0%BC%D0%B0%D1%82%D0%BE%D1%80" class="mention hashtag" rel="nofollow noopener" target="_blank">#трансформатор</a> <a href="https://zhub.link/tags/%D0%BF%D0%B0%D0%BC%D1%8F%D1%82%D1%8C_%D1%82%D1%80%D0%B0%D0%BD%D1%81%D1%84%D0%BE%D1%80%D0%BC%D0%B5%D1%80%D0%BE%D0%B2" class="mention hashtag" rel="nofollow noopener" target="_blank">#память_трансформеров</a>

Matthias Nau#8 The benefits of <a href="https://neuromatch.social/tags/Multitask" class="mention hashtag" rel="nofollow noopener" target="_blank">#Multitask</a> studies are huge!Most importantly, they allow testing the prevalent assumption of <a href="https://neuromatch.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>, yielding results with high chance of generalizing beyond the lab. What's more, they even enable the discovery of *new concepts*!

JMLR'Three-Way Trade-Off in Multi-Objective Learning: Optimization, Generalization and Conflict-Avoidance', by Lisha Chen, Heshan Fernando, Yiming Ying, Tianyi Chen.<a href="http://jmlr.org/papers/v25/23-1287.html" rel="nofollow noopener" translate="no" target="_blank">http://jmlr.org/papers/v25/23-1287.html</a> <a href="https://sigmoid.social/tags/objectives" class="mention hashtag" rel="nofollow noopener" target="_blank">#objectives</a> <a href="https://sigmoid.social/tags/objective" class="mention hashtag" rel="nofollow noopener" target="_blank">#objective</a> <a href="https://sigmoid.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>

Ralph Straumann (@rastrau)<a href="https://urbanists.social/@markstos" class="u-url mention" rel="nofollow noopener" target="_blank">@markstos</a> Impressive work. Connectivity, to me, implies network / topological metrics. I’ve experimented a bit with betweenness centrality (<a href="https://en.wikipedia.org/wiki/Betweenness_centrality" rel="nofollow noopener" translate="no" target="_blank">https://en.wikipedia.org/wiki/Betweenness_centrality</a>) in Python and found it promising (also, e.g., for <a href="https://swiss.social/tags/network" class="mention hashtag" rel="nofollow noopener" target="_blank">#network</a> <a href="https://swiss.social/tags/generalization" class="mention hashtag" rel="nofollow noopener" target="_blank">#generalization</a>). However, it’s computationally expensive. <a href="https://swiss.social/tags/gis" class="mention hashtag" rel="nofollow noopener" target="_blank">#gis</a>

Recent searches

Search options

Administered by:

Server stats:

#generalization