mstdn.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A general-purpose Mastodon server with a 500 character limit. All languages are welcome.

Administered by:

Server stats:

16K
active users

Thread:

I've conducted several informal experiments over the last few weeks about alt text for as described by humans and as provided by systems.

, despite providing plethora of details when describing images, still miss the nuances of what the photos contain. Human certainly continue to be better at conveying context.

@ppatel did you explore the hybrid approach?

Something I’ve been doing for alt text is pasting the image into Claude, getting its first draft of alt text and then prompting for follow-up improvements: “shorter”, “more details about the lighthouse” etc

(To be honest I usually then manually edit the text as well before using it)

@simon @ppatel
One problem when AI tech companies train their models using publicly available data without asking permission first, is that they've very likely scraped up family photo albums and Instagram feeds. Their AI can thus identify people by their full name in any photos a would-be stalker were to upload. To avoid the bad publicity, AI companies blur out all the faces.

As a result the AI will describe the presence of "gray blocks" in the image
masto.ai/@bornach/112987668496

An image featuring the old £10 note and a new £10 note is uploaded to Bing Copilot. When asked to describe the image it replies:

"The image shows two £10 banknotes from the Bank of England placed on a wooden surface. The top banknote appears to be an older version, while the bottom one seems to be a newer version. Both banknotes have sections obscured with a gray block, likely to conceal sensitive information such as serial numbers. The newer banknote features several security elements, including a transparent window and holographic details, highlighting the evolution in design and security features over time."
MastodonBornach (@bornach@masto.ai)Attached: 1 image @jonthegeek@fosstodon.org @FediThing@chinwag.org @RomanVilgut@graz.social @alexisbushnell@toot.wales @breadandcircuses@climatejustice.social So instead I tried asking Bing Copilot to describe an image of an item that only just came out. https://fosstodon.org/@bornach/112914156404626890 Although it recognized that the image was a comparison between old and new £10 note, it completely failed to describe the most glaringly obvious difference in the two notes
Pratik Patel

@bornach @simon I used one of my tools to get a description of the photo in the original post.

"The photo on the webpage shows two ten-pound banknotes placed on a wooden surface. The top banknote features a portrait of Queen Elizabeth II, while the bottom banknote features a portrait of King Charles III. Both notes have intricate designs and security features typical of currency."