Zobrazeno 1 - 2
of 2
pro vyhledávání: '"van Merwijk, Chris"'
Influence diagrams have recently been used to analyse the safety and fairness properties of AI systems. A key building block for this analysis is a graphical criterion for value of information (VoI). This paper establishes the first complete graphica
Externí odkaz:
http://arxiv.org/abs/2202.11629
We analyze the type of learned optimization that occurs when a learned model (such as a neural network) is itself an optimizer - a situation we refer to as mesa-optimization, a neologism we introduce in this paper. We believe that the possibility of
Externí odkaz:
http://arxiv.org/abs/1906.01820