[新聞] Google AI史上首次戰勝人類圍棋選手 - Gossiping板

Disp BBS guest 註冊登入(i) 線上人數: 716

首頁(home) 上頁(↑) 下頁(↓) 末頁(end)

※ 本文為 MindOcean 轉寄自 ptt.cc 更新時間: 2016-01-28 22:31:44

看板 Gossiping

作者 snowpoint (暱稱)
標題 [新聞] Google AI史上首次戰勝人類圍棋選手
時間 Thu Jan 28 18:47:30 2016

原文網址：http://goo.gl/zQgUVx

Google AI algorithm masters ancient game of Go : Nature News & Comment

Deep-learning software defeats human professional for first time. ...

(Nature)

Google AI algorithm masters ancient game of Go
Google AI 精通了古老的圍棋遊戲

Deep-learning software defeats human professional for first time.
深度學習軟體史上首次擊敗人類職業選手

A computer has beaten a human professional for the first time at
Go — an ancient board game that has long been viewed as
one of the greatest challenges for artificial intelligence (AI).

長久以來，圍棋被視為AI領域最艱難的挑戰之一，
如今電腦首次成功擊敗了人類職業選手

The best human players of chess, draughts and backgammon have all been
outplayed by computers. But a hefty handicap was needed for computers to win at Go.
Now Google’s London-based AI company, DeepMind, claims that
its machine has mastered the game.

在西洋棋、西洋跳棋與雙陸棋領域，人類早已不敵電腦。
但電腦要在圍棋獲得勝利有很大的障礙。
而現在，Google在2014年買下的倫敦DeepMind 公司，聲稱已精通這個遊戲。

DeepMind’s program AlphaGo beat Fan Hui, the European Go champion,
five times out of five in tournament conditions, the firm reveals in research published
in Nature on 27 January1. It also defeated its silicon-based rivals,
winning 99.8% of games against the current best programs.
The program has yet to play the Go equivalent of a world champion,
but a match against South Korean professional Lee Sedol,
considered by many to be the world’s strongest player, is scheduled for March.
“We’re pretty confident,” says DeepMind co-founder Demis Hassabis.

DeepMind 所開發的軟體AlphaGo在標準比賽規則中以
五戰五勝的成績擊敗歐洲圍棋冠軍
Fan Hui，該公司在1月27日出版的《Nature》公開這項消息。
而對上目前頂級的圍棋程式時，則有99.8%的勝率。
這套程式目前還沒有跟世界冠軍對弈過，
但今年三月，它將要與南韓職業選手Lee Sedol對弈，
該位選手被許多人認為是世界最強。
DeepMind的共同創辦人Demis Hassabis對此表示「我們相當有信心」。

“This is a really big result, it’s huge,” says R幦i Coulom, a programmer in
Lille, France, who designed a commercial Go program called Crazy Stone.
He had thought computer mastery of the game was a decade away.

「這結果超猛、超狂」R幦i Coulom這樣說到，他是住在法國Lille的
一位圍棋遊戲程式設計師，他曾寫過一款叫做Crazy Stone的圍棋遊戲程式。
他原以為電腦要主宰圍棋還要數十年。

The IBM chess computer Deep Blue, which famously beat grandmaster
Garry Kasparov in 1997, was explicitly programmed to win at the game.
But AlphaGo was not preprogrammed to play Go: rather,
it learned using a general-purpose algorithm that allowed it to interpret
the game’s patterns, in a similar way to how a DeepMind program
learned to play 49 different arcade games2.

IBM的西洋棋電腦：深藍(Deep Blue)在1997年擊敗西洋棋大師Garry Kasparov，
但深藍是刻意為了西洋棋而寫出來的程式。然而AlphaGo並非為圍棋而設計，
它是利用一套綜合分析演算法在遊戲中學習一款遊戲的規則，
它也用這種方式學了49種不同遊戲(Arcade Games)的玩法。

This means that similar techniques could be applied to other AI domains
that require recognition of complex patterns, long-term planning and
decision-making, says Hassabis. “A lot of the things we’re trying to do in the world
come under that rubric.” Examples are using medical images to make diagnoses
or treatment plans, and improving climate-change models.

這代表其他類似領域的──需要複雜認知、長期規劃與決策的AI，
都可以套用此項技術，Hassabis說到：「世界上許多議題都跟此有關」，
舉例來說，利用醫療影像來診斷、決定療法，以及建造氣候變遷模型。

In China, Japan and South Korea, Go is hugely popular and is even played by celebrity
professionals. But the game has long interested AI researchers because of
its complexity. The rules are relatively simple: the goal is to gain
the most territory by placing and capturing black and white stones on a 199 grid.
But the average 150-move game contains more possible board configurations
— 10^170 — than there are atoms in the Universe,
so it can’t be solved by algorithms that search exhaustively for the best move.

在中國、日本以及南韓，圍棋有很多知名的選手，是相當熱門的遊戲。
而由於這遊戲相當複雜，長期以來一直是AI研究者的興趣。
從規則來說相對的簡單：在黑棋與白棋的包圍戰中，取得最多領土的人獲勝。
但一場圍棋遊戲中，平均有150回合，
這當中包含了 10^170 種可能性，遠比已知宇宙的原子總數還多，
因此它無法用窮舉的方式找出最佳解。

Abstract strategy
抽象謀略

Chess is less complex than Go, but it still has too many possible configurations to
solve by brute force alone. Instead, programs cut down their searches by
looking a few turns ahead and judging which player would have the upper hand.
In Go, recognizing winning and losing positions is much harder:
stones have equal values and can have subtle impacts far across the board.

西洋棋較圍棋相對簡單，但要用暴力算法解決的話，可能性還是太多，
因此，程式只計算未來幾個回合的下法，來判斷誰將會佔上風。
但在圍棋中要判斷誰佔上風困難許多，黑棋跟白棋顆顆等值，
細微的轉變都有可能會影響全局。

To interpret Go boards and to learn the best possible moves, the AlphaGo program

applied deep learning in neural networks — brain-inspired programs in which connections between layers of simulated neurons are strengthened through examples and experience. It first studied 30 million positions from expert games, gleaning abstract information on the state of play from board data, much as other programmes categorize images from pixels. Then it played against itself across 50 computers, improving with each iteration, a technique known as reinforcement learning.

為了要在遊戲進行中找出可能的最佳解，AlphaGo採用了
深度學習類神經網路，這項技術以人腦結構啟發，
它可以模擬多層次神經元網路，

*這些神經元在經過經驗學習後會產生變化，當下次碰到類似的問題時，
可以快速產生解決該問題的模糊反應；
可以理解成──用了XX很多次就會變成XX的形狀這種感覺。
*這邊是譯者個人的註解，讓各位有個簡單的概念

它首先從專業比賽中的3千萬個回合，萃取出遊戲全局狀態的抽象資訊，
類似其他圍棋程式對棋譜進行分類，接著再用50台平行連線電腦
跟它自己對弈，每個回合它都會不斷進化，這項技術被稱作強化學習。

“Deep learning is killing every problem in AI.”
「AI利用深度學習解決所有問題」

The software was already competitive with the leading commercial Go programs, which select the best move by scanning a sample of simulated future games. DeepMind then combined this search approach with the ability to pick moves and interpret Go boards — giving AlphaGo a better idea of which strategies are likely to be successful. The technique is “phenomenal”, says Jonathan Schaeffer, a computer scientist at the University of Alberta in Edmonton, Canada, whose software Chinook solved3 draughts in

2007. Rather than follow the trend of the past 30 years of trying to crack games using computing power, DeepMind has reverted to mimicking human-like knowledge, albeit by training, rather than by being programmed, he says. The feat also shows the power of deep learning, which is going from success to success, says Coulom. “Deep learning is killing every problem in AI.”

現存的商用圍棋程式已經相當有競爭力，它們藉由掃描棋譜，
模擬未來幾個回合來找出最佳解。
而DeepMind還結合了綜觀當下局勢的能力，
讓AlphaGo更容易找出最有可能成功的戰略。
Alberta in Edmonton 大學的電腦科學家Jonathan Schaeffer說：
「這項技術『太神了』」，這人在2007年設計出西洋跳棋的必勝程式。

他認為這跳脫了過去30年利用計算能量暴力破解的方式，
回歸模仿人類的取得知識的方式：藉由經驗學習，而非預先寫好的程式劇本。
Coulom則認為，這也展現出了深度學習的力量，
是一項戰無不勝的技術──「AI利用深度學習解決所有問題」

AlphaGo plays in a human way, says Fan. “If no one told me, maybe I would think the player was a little strange, but a very strong player, a real person.” The program seems to have developed a conservative (rather than aggressive) style, adds Toby Manning, a lifelong Go player who refereed the match.

AlphaGo以人類的方式在玩遊戲，Fan(被幹掉的歐洲冠軍)說到：
「如果沒人跟我說，我會以為他是個有點怪但很強的人類選手」。
一位從小就玩圍棋的玩家Toby Manning對這場圍棋比賽評論到：
「這套程式看起來有自己一套穩健而非侵略的風格」

Google’s rival firm Facebook has also been working on software that uses machine learning to play Go. Its program, called darkforest, is still behind commercial state-of-the-art Go AI systems, according to a November preprint4.

Google的對手公司Facebook也著手在機器學習的圍棋軟體上。
該公司的程式叫做黑森林/darkforest，但根據2015年 11月的資料顯示，
它仍不及市面上頂級的圍棋AI

Hassabis says that many challenges remain in DeepMind’s goal of developing a generalized AI system. In particular, its programs cannot yet usefully transfer their learning about one system — such as Go — to new tasks; a feat that humans perform seamlessly. “We’ve no idea how to do that. Not yet,” Hassabis says.

Hassabis提到，DeepMind仍有相當多的挑戰，他們的目標是開發出
通用/全用途AI。舉例來說，他們的程式還無法將一個領域上學習到的經驗
──例如圍棋，套用到另一個領域上，這對人類來說輕而易舉。
Hassabis說：「我們對此毫無頭緒，但總有一天…」

Go players will be keen to use the software to improve their game, says Manning, although Hassabis says that DeepMind has yet to decide whether it will make a commercial version.

Manning認為圍棋玩家可以用這套軟體鞭策自己，
但Hassabis提到DeepMind還沒決定這套軟體是否要商業化。

AlphaGo hasn’t killed the joy of the game, Manning adds. Strap lines boasting that Go is a game that computers can’t win will have to be changed, he says. “But just because some software has got to a strength that I can only dream of, it’s not going to stop me playing.”

Manning 補充到AlphaGo不會抹殺遊戲的樂趣，
它只是改變了電腦贏不了圍棋這件事，「不過是個軟體，
達到了我夢想成為的強度，我還是會繼續玩圍棋」

--
Sent from my Windows

--
※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 122.146.86.153
※ 文章代碼(AID): #1MgV74kh (Gossiping)
※ 文章網址: https://www.ptt.cc/bbs/Gossiping/M.1453978052.A.BAB.html

推 hermionex: 拿幾次得分王就跩上天了1F 01/28 18:48

推 NovemberShit: 完了魔鬼終結者要誕生了2F 01/28 18:48

推 manlike: 沒屁用3F 01/28 18:49

推 TaricOP: 超狂4F 01/28 18:49

推 coldlian: 自己跟自己對弈www5F 01/28 18:50

推 newwu: R幦i6F 01/28 18:50

→ dklash: 贏亞洲前幾名再來講歐美圍棋水準就像亞洲西洋棋水準一樣7F 01/28 18:50

推 kurtsgm: 用了XX很多次就會變成XX的形狀...你故意這樣翻的吧 XDD8F 01/28 18:50

推 aya16810: 不過是個軟體XDDD9F 01/28 18:51

推 chaoliu: 內文要看他說得很清楚了這不是專為圍棋社季的AI10F 01/28 18:51

推 tkc7: 不過感覺打敗人腦是遲早的事11F 01/28 18:51

→ chaoliu: 他也可以駕馭其他的遊戲很猛12F 01/28 18:51

推 lsc36: 不過是個軟體13F 01/28 18:52

推 jakert123: 人都會失誤圍棋這種要精算幾目地的遊戲 AI能做得比人14F 01/28 18:52

推 bobyhsu: 不過是個軟體15F 01/28 18:52

推 Cervelo1995: 人類可以出怪招, 而且層出不窮XDDD16F 01/28 18:53

推 POWERSERIES: 天網要誕生了17F 01/28 18:53

→ jakert123: 好也不用太意外....18F 01/28 18:53

推 verydisco: 等打敗佐為再說吧19F 01/28 18:53

推 wuming2: 歐洲冠軍? 還好不是日韓20F 01/28 18:54

→ se2422: 天網21F 01/28 18:55

推 orze04: 只是贏歐洲而已？22F 01/28 18:55

推 YQE766: 推翻譯23F 01/28 18:56

→ orze04: 現在圍牆頂尖是中國和南韓24F 01/28 18:56

推 iamstudent: 其實距離做出天網非常近了，把下棋改成兵棋推演...25F 01/28 18:57

推 winken2004: 有讓子嗎?26F 01/28 18:58

推 s93rm6: 沒27F 01/28 18:59

推 scratch01: XD歐洲冠軍28F 01/28 18:59

推 sky0302: 原PO不是圈內人吧李世石 Lee Sedol29F 01/28 18:59

推 woow1225: 車欠骨豐30F 01/28 19:00

推 bj45566: 「超猛、超狂！」 XDDD31F 01/28 19:01

推 orze04: 這個歐洲冠軍好像才二段32F 01/28 19:02

推 alau: 沒人貼終結者BGM33F 01/28 19:02

推 typekid: R幦i 好怪的名字34F 01/28 19:02

推 eric820813: XDDDDD35F 01/28 19:02

推 fujkokwj: 先贏東亞頂尖棋手再來風光..36F 01/28 19:02

→ bj45566: Google 是買下倫敦的 DeepMind，所以英國的 AI 超強？37F 01/28 19:03

→ fujkokwj: 下一場就要對李世石就有參考價值38F 01/28 19:03

→ snowpoint: 嗯，我不會圍棋，但有在做神經網路39F 01/28 19:04

推 e1q3z9c7: 雙陸棋說實在運氣成份很高= =40F 01/28 19:04

噓 dufflin: vision 是妳41F 01/28 19:05

→ Rinehot: 重點根本就不在目算而是盤中的形式判斷42F 01/28 19:05

→ kevin0733: 先打敗張栩再來說43F 01/28 19:05

推 ms0499215: 我圍棋6段我賭人類會贏44F 01/28 19:05

推 oncemore: 好像沒授子跟亞洲的對決不授4子不會贏45F 01/28 19:06

推 andrewyllee: 哇贏李世刀耶這軟體頗強46F 01/28 19:09

→ Rinehot: go版有棋譜有興趣的可以看一下47F 01/28 19:10

→ andrewyllee: 靠邀還沒下贏了再說48F 01/28 19:10

推 mark0912n: 還沒跟李世石下過吧三月才要下歐洲冠軍根本是菜49F 01/28 19:12

推 cdpicker: 佐為附身電腦??50F 01/28 19:12

推 ctes940008: 天網不遠了嗎？51F 01/28 19:13

推 cvn65: 對啦足球打遍亞洲無敵手也可以聲稱精通足球了52F 01/28 19:16

推 lili300: 想到POI裡芬奇教Machine下西洋棋那段53F 01/28 19:17

推 DarkerDuck: 不過就是車欠骨豊54F 01/28 19:18

推 c7683fh6: 天網都是真的55F 01/28 19:19

推 sharb: 塔矢亮跟這AI下會射出來吧56F 01/28 19:20

推 ping870224: yoyodiy有繞過AI的軟體57F 01/28 19:20

推 ksxo: 找黑嘉嘉來讓電腦分心58F 01/28 19:21

推 sky0302: 這比賽的賣點是獎金100萬美金比目前世界冠軍獎金都要高59F 01/28 19:21

推 DarkerDuck: 強AI真的不遠了，deep learning可以應用的領域非常廣60F 01/28 19:21

→ Kermei: 問題是歐洲圍棋冠軍跟台灣足球冠軍..好比某國足球贏台灣61F 01/28 19:23
→ Kermei: 新聞就寫距離贏巴西德國義大利阿根廷不遠了...

推 sky0302: 老實說那個歐洲冠軍幹不過台灣業餘頂尖63F 01/28 19:24

→ keny80206: 時光機快出現了64F 01/28 19:25

推 mystage: 有學習能力的ai, 完了，核彈要發射了65F 01/28 19:26

推 elephanting: 用了XX很多次就會變成XX的形狀這種感覺.............66F 01/28 19:29

推 Lawlans: 很用心的翻譯67F 01/28 19:30

推 followwar: 都說深度學習是下個世紀最紅的技術..68F 01/28 19:33

→ qqq0103 …

推 elle: 我懂了初音也是軟體但她是特別的70F 01/28 19:36

推 tp6b123: 沒人發現用XX幾次就變成XX的形狀嗎?71F 01/28 19:38

→ gameguy: ibm深藍，我困了72F 01/28 19:41

→ goldduck: 能贏五盤一盤未失不容易了73F 01/28 19:42

推 kon0419: 左右互搏的概念74F 01/28 19:50

推 longkiss0618: 軟體75F 01/28 19:53

→ viciousqq: SAI76F 01/28 19:59

推 likeyousmile: 第一手大元嚇嚇它77F 01/28 19:59

→ ffaarr: 贏歐洲冠軍就超強了啦，突破了職棋跟業餘的那道牆78F 01/28 20:03

推 cloud1030: 用了10次，子瑜的形狀79F 01/28 20:07

推 hw1: Google is Skynet80F 01/28 20:08

推 mr680224: 3月和李世石對奕，到時就知道和一流選手的差距了81F 01/28 20:09

推 ilove88th: 藤原佐為表示 :82F 01/28 20:14

推 zelkova: 用了XX很多次就會變成XX的形狀這種感覺 XDDD83F 01/28 20:44

推 kerbi: 雖然說是一個里程碑了但其實要贏了中日韓的頂尖職業選手才84F 01/28 20:47
→ kerbi: 比較有指標性

推 yef7591: 還是個軟體86F 01/28 20:53

推 jeff830621: AI > EU > NA XDDDDDDDD87F 01/28 21:17

推 s9209122222: 看介紹說那個歐洲冠軍是從中國找過去帶隊的職業二段88F 01/28 21:30
→ s9209122222: 棋士，應該不至於輸台灣業餘吧？

推 sky0302: 現役職二都未必能贏台灣業餘頂尖更何況是遠離前線的棋手90F 01/28 21:48
→ sky0302: 台灣新科初段說他和台灣業餘天王下三七開他落下風
→ sky0302: 還有現在段位根本不代表棋力很多退休八九段根本像菜一樣

推 s9209122222: 那些業餘的不往職業發展是因為賺不到錢嗎？93F 01/28 22:10

推 sky0302: 有些因為年齡有些是寧為雞頭有些為了錢94F 01/28 22:17

推 umano: 電腦要在圍棋獲得勝利有很大的障礙95F 01/28 22:18

→ sky0302: 現在兩岸的業餘比賽都很好賺低階職業只能靠教棋過活96F 01/28 22:19

※ 看板: Gossiping　文章推薦值: 0 目前人氣: 0 累積人氣: 2343　

分享網址: 複製

DispBBS

1樓時間: 2016-01-28 21:50:01 (台灣)

→

jengyic

　 (編輯過) TW

等 AI 寫出第一支有用的程式再來叫我

2樓時間: 2016-01-28 21:53:01 (台灣)

→

k123amz

　 01-28 21:53 TW

什麼時候有粗因實體化

3樓時間: 2016-01-28 21:55:48 (台灣)

→

m78a30

　 01-28 21:55 TW

我的學習能力書AI= =

4樓時間: 2016-01-28 23:22:32 (澳大利亞)

→

fuckchina

　 01-28 23:22 AU

deep learning還有很長的路要走 (菸~

5樓時間: 2016-01-28 23:38:59 (台灣)

→

alenfbi

　 01-28 23:38 TW

BMG支援 
Terminator Salvation(Theme song) - YouTube

回到看板(←)《Gossiping》

r)回覆 e)編輯 d)刪除 M)收藏 ^x)轉錄同主題: =)首篇 [)上篇 ])下篇

回列表(←) 分享