pandas groupby sort within groups

Solution 1:

You could also just do it in one go, by doing the sort first and using head to take the first 3 of each group.

In[34]: df.sort_values(['job','count'],ascending=False).groupby('job').head(3)

Out[35]: 
   count     job source
4      7   sales      E
2      6   sales      C
1      4   sales      B
5      5  market      A
8      4  market      D
6      3  market      B

Solution 2:

What you want to do is actually again a groupby (on the result of the first groupby): sort and take the first three elements per group.

Starting from the result of the first groupby:

In [60]: df_agg = df.groupby(['job','source']).agg({'count':sum})

We group by the first level of the index:

In [63]: g = df_agg['count'].groupby('job', group_keys=False)

Then we want to sort ('order') each group and take the first three elements:

In [64]: res = g.apply(lambda x: x.sort_values(ascending=False).head(3))

However, for this, there is a shortcut function to do this, nlargest:

In [65]: g.nlargest(3)
Out[65]:
job     source
market  A         5
        D         4
        B         3
sales   E         7
        C         6
        B         4
dtype: int64

So in one go, this looks like:

df_agg['count'].groupby('job', group_keys=False).nlargest(3)

Converting Go struct to JSON

How to get CSS to select ID that begins with a string (not in Javascript)?

How to evaluate http response codes from bash/shell script?

How to write an async method with out parameter?

Show that the sequence $\sqrt{2}, \ \sqrt{2+\sqrt{2}}, \ \sqrt{2+\sqrt{2+\sqrt{2+}}}...$ converges and find its limit. [duplicate]

Principal period of $\sin\frac{3x}{4}+\cos\frac{2x}{5}$ [duplicate]

if $f'(x)\rightarrow L$ as $ x \rightarrow \infty$, $-\infty \leq L \leq \infty $ then $ f(x)/x \rightarrow L $ as $x \rightarrow \infty$ [duplicate]

The eigenvalues and corresponding eigenvectors for the matrix

Integral of $\sin x \cos x$ using two methods differs by a constant? [duplicate]

Using induction to prove that $\sum_{r=1}^n r\cdot r! =(n+1)! -1$

Complete balanced (signed) residue System of an odd modulo m

Explicit isomorphism between the four dimensional orthogonal Lie algebra and the direct sum of special linear Lie algebras of dimension 3.

pandas groupby sort within groups

Solution 1:

Solution 2:

Related

Recent Posts