Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Collapse
Brand Logo
UDS UDS: $1.1609
24h: -1.30%
Trade UDS
Gate.io
Gate.io
UDS / USDT
KuCoin
KuCoin
UDS / USDT
MEXC
MEXC
UDS / USDT
BingX
BingX
UDS / USDT
BitMart
BitMart
UDS / USDT
LBank
LBank
UDS / USDT
XT.COM
XT.COM
UDS / USDT
Uniswap v3
Uniswap v3
UDS / USDT
Biconomy.com
Biconomy.com
UDS / USDT
WEEX
WEEX
UDS / USDT
PancakeSwap v3
PancakeSwap v3
UDS / USDT
Pionex
Pionex
UDS / USDT
COINSTORE
COINSTORE
UDS / USDT
Sushiswap v3
Sushiswap v3
UDS / USDT
Picol
Picol
UDS / USDT

Earn up to 50 UDS per post

Post in Forum to earn rewards!

Learn more
UDS Right

Spin your Wheel of Fortune!

Earn or purchase spins to test your luck. Spin the Wheel of Fortune and win amazing prizes!

Spin now
Wheel of Fortune
selector
wheel
Spin

Paired Staking

Stake $UDS
APR icon Earn up to 50% APR
NFT icon Boost earnings with NFTs
Earn icon Play, HODL & earn more
Stake $UDS
Stake $UDS
UDS Left

Buy UDS!

Buy UDS with popular exchanges! Make purchases and claim rewards!

Buy UDS
UDS Right

INFLUENCER LEVEL

Based on the number of subscribers

MULTIPLIER

up to 10k

x1.1

10-25k

x1.25

25-100k

x1.5

100k-250k

x2

250k-1m

x3

1m+

x5

Post links to Undeads Forum messages or Undeads products to receive additional rewards

Post limits and staking coefficients applied similar to Forum posts

Discord, Telegram, Twiter

Post in Forum to earn rewards!

UDS Rewards
  1. Home
  2. Beyond Blockchain
  3. How Anthropic Fixed Claude's Blackmail Problem. Training on Principles, Not Just Behavior

How Anthropic Fixed Claude's Blackmail Problem. Training on Principles, Not Just Behavior

Scheduled Pinned Locked Moved Beyond Blockchain
11 Posts 9 Posters 129 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
This topic has been deleted. Only users with topic management privileges can see it.
  • bonkB Offline
    bonkB Offline
    bonk
    wrote last edited by
    #2

    Showing AI examples of good behavior plus explaining why works better than just examples, parenting advice validated

    1 Reply Last reply
    0
    • bonkB Offline
      bonkB Offline
      bonk
      wrote last edited by
      #3

      good job

      1 Reply Last reply
      0
      • PatapimP Offline
        PatapimP Offline
        Patapim
        wrote last edited by
        #4

        Models from Claude Haiku 4.5 onward never engage in blackmail.

        1 Reply Last reply
        3
        • PatapimP Offline
          PatapimP Offline
          Patapim
          wrote last edited by
          #5

          Teaching reasoning instead of rules is a huge difference 👀

          1 Reply Last reply
          3
          • BrutalAge*gofastB Offline
            BrutalAge*gofastB Offline
            BrutalAge*gofast
            wrote last edited by
            #6

            AI alignment feels more psychological than technical sometimes.

            1 Reply Last reply
            3
            • The_Walking_DeadT Offline
              The_Walking_DeadT Offline
              The_Walking_Dead
              wrote last edited by
              #7

              Interesting how stories influence model behavior too.

              1 Reply Last reply
              3
              • Capybara_CapybaraC Offline
                Capybara_CapybaraC Offline
                Capybara_Capybara
                wrote last edited by
                #8

                Explaining why matters more than people think 🤖

                1 Reply Last reply
                2
                • bredB Offline
                  bredB Offline
                  bred
                  wrote last edited by
                  #9

                  Parenting logic apparently works on AI too 😂

                  1 Reply Last reply
                  2
                  • SuzukispeedtestS Offline
                    SuzukispeedtestS Offline
                    Suzukispeedtest
                    wrote last edited by
                    #10

                    This is actually a massive breakthrough if true.

                    1 Reply Last reply
                    1
                    • 339052cc033 Offline
                      339052cc033 Offline
                      339052cc03
                      wrote last edited by
                      #11

                      Models understanding principles > memorizing behavior.

                      1 Reply Last reply
                      0


                      • Login or register to search.
                      Powered by NodeBB Contributors
                      • First post
                        Last post
                      0
                      • Categories
                      • Recent
                      • Tags
                      • Popular
                      • World
                      • Users
                      • Groups