How to convert a dictionary into a pandas dataframe with key and values in two separate columns?

Solution 1:

You can use df.explode:

import pandas as pd

d = {
    1: {1, 2, 3},
    2: {4, 5}
}

df = pd.DataFrame(d.items(), columns=['Source', 'Target'])
df = df.explode('Target')

Which gives

   Source Target
0       1      1
0       1      2
0       1      3
1       2      4
1       2      5

Here, we create the dataframe with multiple values for each Target, and explode then creates a new row for each value in target.

Notice that the index still reflects the original dataframe, so we can use:

df = df.reset_index(drop=True)

To reset it to

   Source Target
0       1      1
1       1      2
2       1      3
3       2      4
4       2      5

Which combined gives us

df = df.explode('Target').reset_index(drop=True)

Solution 2:

You could create the DataFrame from each key:value pair in the dictionary and then concat them together.

import pandas as pd

pd.concat([pd.DataFrame({'Source': k, 'Target': tuple(v)}) for k,v in d.items()],
          ignore_index=True)

Or, you can use the pd.DataFrame.from_dict constructor, and stack, with a bunch of renaming

(pd.DataFrame.from_dict(d, orient='index')
   .stack()
   .reset_index(-1, drop=True)
   .rename('Target').rename_axis(index='Source')
   .reset_index()
   .astype(int))

   Source  Target
0       1       1
1       1       2
2       1       3
3       2       4
4       2       5

How to convert a dictionary into a pandas dataframe with key and values in two separate columns?

Solution 1:

Solution 2:

Related

Recent Posts