[問題] hadoop map reduce如何一次跑三種reduce
我的問題是要把一個大log檔倒入db後
再下sql
select att_id , max(timestamp) , min(timestamp) , count(1) from table_a
這件事改成用hadoop的map reduce做
我已經寫好分別對 max, min , count三件事的的reducer
att_id是map output key, value前兩個Map來的是timestamp 包成longwritable
後面一個的是intwritable
但是要分成三次來做,實在是浪費了兩次maper的處理時間
而且分成三個輸出,要再另外合併
請問有沒有辦法在一次reduce裡完成
--
※ 發信站: 批踢踢實業坊(ptt.cc)
◆ From: 210.69.230.190
→
08/10 14:54, , 1F
08/10 14:54, 1F
→
08/10 15:30, , 2F
08/10 15:30, 2F
推
08/10 18:23, , 3F
08/10 18:23, 3F
推
08/10 21:22, , 4F
08/10 21:22, 4F
→
08/10 21:33, , 5F
08/10 21:33, 5F
→
08/11 00:42, , 6F
08/11 00:42, 6F
→
08/11 07:09, , 7F
08/11 07:09, 7F
推
08/11 19:09, , 8F
08/11 19:09, 8F
→
08/11 19:10, , 9F
08/11 19:10, 9F
推
08/11 19:12, , 10F
08/11 19:12, 10F