Re: [情報] 光線追蹤太耗資源：《古墓奇兵：暗影》開

看板PC_Shopping作者arrenwu (Colors Guardian)時間5年前 (2018/08/22 08:06)推噓20(20推 0噓 28→)

留言48則, 21人參與討論串3/3 (看更多)

: 推 a2935373 : 然後在隔壁版看到V100好像還是很搶手懷疑老黃這次 08/22 03:18 : → a2935373 : 是為了出給專業公司順便炒作一下遊戲來交代股東不 08/22 03:18 : → a2935373 : 然RTX這幾張真的怎麼看都不像遊戲用 08/22 03:18 講到這個V100 我朋友最近分享一個 ML Benchmark Result 給我 https://github.com/u39kun/deep-learning-benchmark 先看一下 V100 和 1080 Ti 的規格差異 Model Memory CUDA Cores Tensor Cores Tesla V100 16GB HBM2 5120 640 1080 Ti 11GB GDDR5 3584 0 V100 boosted Frequency 1455 MHz 1080Ti boosted Frequency: 1582 MHz 測試結果 PyTorch 0.3.0 1080 Ti 精度 vgg16 eval vgg16 train resnet152 eval resnet152 train 32-bit 39.3ms 131.9ms 57.8ms 206.4ms 16-bit 33.5ms 117.6ms 46.9ms 193.5ms V100 精度 vgg16 eval vgg16 train resnet152 eval resnet152 train 32-bit 26.2ms 83.5ms 38.7ms 136.5ms 16-bit 12.6ms 58.8ms 21.7ms 92.9ms Tensorflow 1.4.0 1080 Ti 精度 vgg16 eval vgg16 train resnet152 eval resnet152 train 32-bit 43.4ms 131.3ms 69.6ms 300.6ms 16-bit 38.6ms 121.1ms 53.9ms 257.0ms Tensorflow 1.5.0 V100 精度 vgg16 eval vgg16 train resnet152 eval resnet152 train 32-bit 24.0ms 71.7ms 39.4ms 199.8ms 16-bit 13.6ms 49.4ms 22.6ms 147.4ms V100 TDP 300W 1080Ti TDP 275W 這樣看下來，Tensor Core 的強是表現在能耗比上面 V100 一張要 $8900 @@" -- 「保護這個城市的我，不存在弱點。 ...遊戲玩很爛...？別說了......拜託你別再說了！！！」～琴葉 https://i.imgur.com/7JHnwBV.jpg

-- ※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 73.158.52.60 ※ 文章網址: https://www.ptt.cc/bbs/PC_Shopping/M.1534896403.A.616.html

→

F04E

08/22 08:10, 5年前 , 1^F

08/22 08:10, 1^F

推

atrix

08/22 08:15, 5年前 , 2^F

08/22 08:15, 2^F

這部分是有差距我補上數據

推

CactusFlower

08/22 08:31, 5年前 , 3^F

08/22 08:31, 3^F

→

jeff40108

08/22 08:49, 5年前 , 4^F

08/22 08:49, 4^F

→

jeff40108

08/22 08:50, 5年前 , 5^F

08/22 08:50, 5^F

推

otaku690

08/22 08:52, 5年前 , 6^F

08/22 08:52, 6^F

是這個沒錯不好意思我以為我剛剛就貼上去了 @@"

→

otaku690

08/22 08:56, 5年前 , 7^F

08/22 08:56, 7^F

https://www.tomshardware.com/news/nvidia-tensor-core-tesla-v100,34384.html 從這篇文章裡面的說法， According to Nvidia, V100’s Tensor Cores can provide 12x the performance of FP32 operations on the previous P100 accelerator, as well as 6x the performance of P100’s FP16 operations. 會沒用嗎？況且如果Benchmark沒有亂寫的話，這個測試應該就類似一般使用吧？

→

kuma660224

08/22 09:04, 5年前 , 8^F

08/22 09:04, 8^F

→

kuma660224

08/22 09:05, 5年前 , 9^F

08/22 09:05, 9^F

效能看起來跟CUDA的數量一致，但是兩者功耗差不多喔

推

hcwang1126

08/22 09:08, 5年前 , 10^F