preasket@lemy.lol to

Showerthoughts@lemmy.world · 1 year ago

The problem with AI alignment is that humans aren't aligned

69

The problem with AI alignment is that humans aren't aligned

preasket@lemy.lol to

Showerthoughts@lemmy.world · 1 year ago

I’m sure there are some AI peeps here. Neural networks scale with size because the number of combinations of parameter values that work for a given task scales exponentially (or, even better, factorially if that’s a word???) with the network size. How can such a network be properly aligned when even humans, the most advanced natural neural nets, are not aligned? What can we realistically hope for?

Here’s what I mean by alignment:

Ability to specify a loss function that humanity wants
Some strict or statistical guarantees on the deviation from that loss function as well as potentially unaccounted side effects

Chat

preasket@lemy.lolOP
link
fedilink
arrow-up
4·
edit-2
1 year ago
Does it? People act in all sorts of sensible and crazy ways even though the basic principle of operation is the same
- Quatity_Control@lemm.ee
  link
  fedilink
  arrow-up
  1·
  1 year ago
  What loss function do you want AI to align on?
  
  If I have a language model AI and an AI designed to function as a nurse, what are they going to align on?

Showerthoughts@lemmy.world

showerthoughts@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

A “Showerthought” is a simple term used to describe the thoughts that pop into your head while you’re doing everyday things like taking a shower, driving, or just daydreaming. The best ones are thoughts that many people can relate to and they find something funny or interesting in regular stuff.

Rules

All posts must be showerthoughts
The entire showerthought must be in the title
Posts must be original/unique
Be good to others - no bigotry - including racism, sexism, ableism, homophobia, transphobia, or xenophobia
Adhere to Lemmy’s Code of Conduct

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

588 users / day
1.15K users / week
518 users / month
1.13K users / 6 months
0 local subscribers
26.7K subscribers
764 Posts
26.4K Comments
Modlog