Rang

From Edgy Parakeet, 4 Months ago, written in Plain Text, viewed 70 times.
URL http://codebin.org/view/ede0e972 Embed
Download Paste or View Raw
  1. df_sort = df.sort_values(['user_id', 'event_time'])
  2. diff_timestamp = df_sort.groupby('user_id')['event_time'].diff().reset_index()
  3.  
  4. for i in range(len(diff_timestamp)):
  5.     diff_timestamp.loc[i,'second'] = (diff_timestamp.loc[i,'event_time'].seconds)
  6.    
  7. new_session = (diff_timestamp['second'].isnull()) | (diff_timestamp['second'] > 600)
  8. df_sort['session_id'] = df_sort.loc[new_session, ['user_id', 'event_time']].groupby('user_id').rank(method='first').astype(int)
  9. df_sort['session_id'] = df_sort['session_id'].fillna(method='ffill') #.astype(int)

Reply to "Rang"

Here you can reply to the paste above