[問題] 用groupby()做累加

看板Python作者luenchang (luen)時間9年前 (2015/03/25 18:29)推噓1(1推 0噓 4→)

留言5則, 3人參與討論串1/2 (看更多)

不好意思，爬文還沒看到有用groupby這個function累加的例子。我的資料是 a list of lists. 每個list裡有zipcode, date及revenue.例如大安區在今年1月1日的銷售是100元 [106,20140101,100].我想做兩件事，一是先把銷售在每個zipcode的每個月加總起來。這部分已經做完了(step 2)。另一個是把月加總累加起來。這部分(step 3) 就不知道該如何做了。程式改了半天還是從第一筆資料累加到最後一筆。但我需要的是在各個zipcode內的累加。為了方便看資料我把它換行排成by column. Python3 有一個accumulate()好像不錯用，但我的版本是2.7.9。可能step 3小小改動就可以得到 desired output. #step1: some mock data mock=[[106,201501,100], [106,201501,200], [106,201502,300], [106,201502,400], [220,201502,200], [220,201502,300], [220,201503,400], [220,201503,500]] #desired output: [[106,201501,300], [106,201502,1000], [220,201502,500], [220,201502,1400]] #step2: sum up revenue for each zipcode and month using groupby() testlist=[] for key, group in groupby(mock, lambda x: str(x[0])+ str(x[1])[0:6]): summation = sum ([ x[2] for x in group]) # monthly sum testlist.append([key, summation]) print testlist #step 3: accumulate monthly summed revenue over month for each zipcode test2=list(zip(*testlist)[1]) print "test2:" print test2 for key, group in groupby(mock, lambda x: str(x[0])): for index, value in enumerate(test2): temp=test2[:index+1] testlist[index].append(reduce(lambda a,b: a+b, temp)) print "another test2:" print testlist -- ※ 發信站: 批踢踢實業坊(ptt.cc), 來自: 60.248.164.177 ※ 文章網址: https://www.ptt.cc/bbs/Python/M.1427279388.A.B26.html

→

luenchang

03/25 18:37, , 1^F

03/25 18:37, 1^F

→

luenchang

03/25 18:38, , 2^F

03/25 18:38, 2^F

→

ccwang002

03/26 01:20, , 3^F

03/26 01:20, 3^F