翻訳と辞書
Words near each other
・ Aizu-Yanaizu Station
・ Aizu-Yokota Station
・ Aizu-Ōshio Station
・ Aizubange, Fukushima
・ Aizuchi
・ Aizuhongō, Fukushima
・ Aizuki Station
・ Aizukotetsu-kai
・ Aizukōgen-Ozeguchi Station
・ Aizuma Station
・ Aizumi, Tokushima
・ Aizumisato, Fukushima
・ Aizupe Manor
・ Aizuri-e
・ Aizutakada, Fukushima
AIXI
・ Aixinga
・ Aixirivall
・ AIXM
・ Aixovall
・ Aixtron
・ Aixàs
・ Aiy
・ Aiyadigal Kadavarkon Nayanar
・ Aiyadurai Jesudasen Appasamy
・ Aiyamperumal
・ Aiyan Kulam
・ Aiyana
・ Aiyanaar Koayiladi
・ Aiyanaar puram


Dictionary Lists
翻訳と辞書 辞書検索 [ 開発暫定版 ]
スポンサード リンク

AIXI : ウィキペディア英語版
AIXI
AIXI is a mathematical formalism for artificial general intelligence.
It combines Solomonoff induction with sequential decision theory.
AIXI was first proposed by Marcus Hutter in 2000 and the results below are proved in Hutter's 2005 book ''Universal Artificial Intelligence''.
AIXI is a reinforcement learning agent;
it maximizes the expected total rewards received from the environment.
Intuitively, it simultaneously considers every computable hypothesis.
In each time step, it looks at every possible program and evaluates how many rewards that program generates depending on the next action taken.
The promised rewards are then weighted by the subjective belief that this program constitutes the true environment.
This belief is computed from the length of the program: longer programs are considered less likely, in line with Occam's razor.
AIXI then selects the action that has the highest expected total reward in the weighted sum of all these programs.
== Definition ==

The AIXI agent interacts sequentially with some (stochastic and unknown to AIXI) environment \mu.
In step ''t'', the agent outputs an action a_t and
the environment responds with an observation o_t and a reward r_t distributed according to the conditional probability
\mu(o_t r_t | a_1 o_1 r_1 ... a_ o_ r_ a_t).
Then this cycle repeats for ''t + 1''.
The agent tries to maximize cumulative future reward r_t + \ldots + r_m for a fixed lifetime ''m''.
Given a current time ''t'' and history a_1 o_1 r_1 ... a_ o_ r_,
the action AIXI outputs is defined as〔(Universal Artificial Intelligence )〕
:
\arg \max_ \sum_ \ldots \max_ \sum_
(+ \ldots + r_m ) \sum_ 2^,

where ''U'' denotes a monotone universal Turing machine, and
''q'' ranges over all programs on the universal machine ''U''.
The parameters to AIXI are the universal Turing machine and the agent's lifetime ''m''.
The latter dependence can be removed by the use of discounting.

抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)
ウィキペディアで「AIXI」の詳細全文を読む



スポンサード リンク
翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.