作者查詢 / dulcet
作者 dulcet 在 PTT [ VideoCard ] 看板的留言(推文), 共11則
限定看板:VideoCard
看板排序:
全部points102OverClocking19Gindis15VideoCard11Immigration9studyabroad8Chan_Mou7Oversea_Job7Aries5NTUEESOCCER5Gossiping4MH4Ancient3EuropeTravel3HatePolitics3KS91-3143WorldCup3Aviation2China_Travel2CMWang2Hunter2JOJO2LoL2NTUDormM62PokeMon2Zastrology2BERSERK1Buddhism1C_and_CPP1Cheer1cookclub1CSMU-MED951EAseries1EYESHIELD211geography961GRE1H-GAME1HSNU_10101INSECT-931JapanStudy1KERORO1KingdomHuang1KS92-3121KS93-3161Lost1Mind1NARUTO1NCCU06_PF1NTUST-TX-B921NTUST_Talk1NY-Yankees1Paradox1PokemonGO1specialman1StupidClown1SuperHeroes1SYSOP1TY_Research1UEFA1WomenTalk1WuFu88-3301<< 收起看板(61)
首頁
上一頁
1
下一頁
尾頁
1F推:num1 和num2是在register, 不會conflict06/18 17:41
2F→:你一定在num1, num2前加了__shared__了 哪掉就OK06/18 17:42
4F推:@@ vector sum怎麼可能把register用完06/18 17:46
5F→:你可以把之前用過的register重複利用06/18 17:47
1F推:看不懂你的舉例 你要做vector sum 還是 sum filter?06/18 17:14
2F→:vector sum 直接做不會有bank conflict06/18 17:16
3F→:sum filter用shared memory,thread個數開跟loading的個數06/18 17:20
4F→:一樣就不會global memory的bank conflict06/18 17:22
6F推:所以說是sum filter,那就用shared memory06/18 17:33
7F→:@@看錯是vector sum06/18 17:33
8F→:把for loop 幹掉 k=threadIdx.x 這樣才對吧?06/18 17:35
首頁
上一頁
1
下一頁
尾頁