Row-wise average for a subset of columns with missing values

Solution 1:

You can simply:

df['avg'] = df.mean(axis=1)

       Monday  Tuesday  Wednesday        avg
Mike       42      NaN         12  27.000000
Jenna     NaN      NaN         15  15.000000
Jon        21        4          1   8.666667

because .mean() ignores missing values by default: see docs.

To select a subset, you can:

df['avg'] = df[['Monday', 'Tuesday']].mean(axis=1)

       Monday  Tuesday  Wednesday   avg
Mike       42      NaN         12  42.0
Jenna     NaN      NaN         15   NaN
Jon        21        4          1  12.5

Solution 2:

Alternative - using iloc (can also use loc here):

df['avg'] = df.iloc[:,0:2].mean(axis=1)

Solution 3:

Resurrecting this Question because all previous answers currently print a Warning.

In most cases, use assign():

df = df.assign(avg=df.mean(axis=1))

For specific columns, one can input them by name:

df = df.assign(avg=df.loc[:, ["Monday", "Tuesday", "Wednesday"]].mean(axis=1))

Or by index, using one more than the last desired index as it is not inclusive:

df = df.assign(avg=df.iloc[:,0:3]].mean(axis=1))

How does $2^{(\log_4{x})}$ become $\sqrt[2]{x}$?

find the value of the product of roots of this quadratic equation

Can $X - Y A^\dagger Y^T\succ0$ be written as an LMI where $A^\dagger$ is a pseudoinverse?

Need clarification on implicit differentiation.

Explanation of terminal objects using limits

For a bounded linear operator T, if $||Tx_o||<\epsilon$, then what can we say the norm of $Tx$ for $x \in$ the epsilon ball around $x_o$

Prove that there exists an $x' \in [a,b]$ such that $f(x') = \frac{\int^b_a fg}{\int^b_a g}$ for the given conditions.

finding out if two vectors are perpendicular or parallel

Proving there exists $k$ such that $p(n)p(n+1)=p(k)$

Let $f:A \to \Bbb R^n$ be measurable. Show that $\{x \in \Bbb R^n \mid m(f^{-1}\{x\}) > 0\}$ is a countable set.

Counting ten-digit numbers whose digits are all different and that are divisible by $11111$

convergence of a series with logarithm

Row-wise average for a subset of columns with missing values

Solution 1:

Solution 2:

Solution 3:

Related

Recent Posts