Tags similar to: Shard Theory
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Show similar
Human Values
Heuristics & Biases
Internal Alignment (Human)
World Modeling
Rationality
AI
Complexity of Value
Reinforcement Learning
Utility Functions
SERI MATS
Psychology
Outer Alignment
Ontology
Inner Alignment
Reward Functions
Interpretability (ML & AI)
Machine Learning (ML)
Community
Subagents
Research Agendas
Embedded Agency
General Alignment Properties
Wireheading
Value Learning
Mesa-Optimization