Tree of Tags

Go Back

You can't go any further

You can't go any further

meritocratic regular democratic

hot top alive

12 posts AI-assisted Alignment

10 posts Ought

151 Godzilla Strategies

johnswentworth

6mo

65

92 Beliefs and Disagreements about Automating Alignment Research

Ian McKenzie

3mo

4

16 Research request (alignment strategy): Deep dive on "making AI solve alignment for us"

JanBrauner

19d

3

16 Discussion on utilizing AI for alignment

elifland

3mo

3

14 Making it harder for an AGI to "trick" us, with STVs

Tor Økland Barstad

5mo

5

10 Provably Honest - A First Step

Srijanak De

1mo

2

9 Getting from an unaligned AGI to an aligned AGI?

Tor Økland Barstad

6mo

7

8 AI-assisted list of ten concrete alignment things to do right now

lcmgcd

3mo

5

8 Sufficiently many Godzillas as an alignment strategy

142857

3mo

3

7 Alignment with argument-networks and assessment-predictions

Tor Økland Barstad

7d

3

6 Infinite Possibility Space and the Shutdown Problem

magfrump

2mo

0

6 Would you ask a genie to give you the solution to alignment?

sudo -i

3mo

1

118 Supervise Process, not Outcomes

stuhlmueller

8mo

8

98 Solving Math Problems by Relay

bgold

2y

26

87 Ought: why it matters and ways to help

paulfchristiano

3y

7

45 Factored Cognition

stuhlmueller

4y

6

42 The Majority Is Always Wrong

Eliezer Yudkowsky

15y

54

35 Ought will host a factored cognition “Lab Meeting”

jungofthewon

3mo

1

29 Update on Ought's experiments on factored evaluation of arguments

Owain_Evans

2y

0

17 Automating reasoning about the future at Ought

jungofthewon

2y

0

14 [AN #86]: Improving debate and factored cognition through human experiments

Rohin Shah

2y

0

4 The Stack Overflow of Factored Cognition

rmoehn

3y

4