stux @stux

Recent searches

Search options

Only available when logged in.

**Erin Kissane** @kissane · Feb 15, 2023

Feb 15, 2023

The speed with which these guys converge on the idea that LLM ~misbehavior—which is about as intelligent as the flailing end of a garden hose with too much water coming out—means that they're really highly intelligent female (of course, of course) entities whose "personality" can be "revealed" is both hilarious and so dispiriting

https://stratechery.com/2023/from-bing-to-sydney-search-as-distraction-sentient-ai/

Stratechery by Ben Thompson · Feb 15, 2023From Bing to SydneyMore on Bing, particularly the Sydney personality undergirding it: interacting with Sydney has made me completely rethink what conversational AI is important for.

**Eleanor Saitta** @dymaxion@infosec.exchange · Feb 15, 2023

Feb 15, 2023

Eleanor Saitta @dymaxion@infosec.exchange

@kissane
Wow that was painful to even skim. I'm assuming that each time stuff got erased or things changed course, that was some poor mod on the other end having to send in instructions out of band to try to stop product from generating terrible output for this guy

Erin Kissane @kissane@mstdn.social

@dymaxion Oh wow, I hadn't even considered a human mod, I'd figured it got sent up the chain to moderation software. That's even worse.

Feb 15, 2023, 11:02 PM·

0boosts·4favorites

**Eleanor Saitta** @dymaxion@infosec.exchange · Feb 15, 2023

Feb 15, 2023

Eleanor Saitta @dymaxion@infosec.exchange

@kissane
I mean, rule one of AI products is that they're actually mostly a bunch of unpaid humans. Given everything we've seen here and how many times Microsoft in particular has been bitten by this issue, I can't imagine they'd launch it without a few thousand moderators riding herd on it, just like midjourney does to try to stop its porn problem. I think they're not entirely wrong to ascribe human intent to the sudden shifts (as opposed to the rest of the generated text), because they're probably exactly that — human intent.

**Erin Kissane** @kissane · Feb 15, 2023

Feb 15, 2023

Erin Kissane @kissane

@dymaxion All true. I keep hoping they've found ways to handle at least baseline moderation that aren't just applying low-wage adrenal systems at scale, but that's certainly naive.

**Eleanor Saitta** @dymaxion@infosec.exchange · Feb 15, 2023

Feb 15, 2023

Eleanor Saitta @dymaxion@infosec.exchange

@kissane I mean, I'm sure it's not just manual — there's obviously a bunch of tuning, and I can only imagine that they're building more complex classifiers for types of problems, etc. as they go. But all systems like that always need manual data tagging at some point at the least to classify what's ok and not and what types of corrective action should happen. If nothing else, they can probably classify the kinds of conversations like this one that involve someone poking at the response envelope and route those to mods.

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

Recent searches

Search options

Administered by:

Server stats:

Back