La diferencia de dos columnas en el marco de datos de pandas en Python se lleva a cabo utilizando los siguientes métodos:
Método #1: Usando el operador ”-” .
import pandas as pd # Create a DataFrame df1 = { 'Name':['George','Andrea','micheal', 'maggie','Ravi','Xien','Jalpa'], 'score1':[62,47,55,74,32,77,86], 'score2':[45,78,44,89,66,49,72]} df1 = pd.DataFrame(df1,columns= ['Name','score1','score2']) print("Given Dataframe :\n", df1) # getting Difference df1['Score_diff'] = df1['score1'] - df1['score2'] print("\nDifference of score1 and score2 :\n", df1)
Producción:
Given Dataframe : Name score1 score2 0 George 62 45 1 Andrea 47 78 2 micheal 55 44 3 maggie 74 89 4 Ravi 32 66 5 Xien 77 49 6 Jalpa 86 72 Difference of score1 and score2 : Name score1 score2 Score_diff 0 George 62 45 17 1 Andrea 47 78 -31 2 micheal 55 44 11 3 maggie 74 89 -15 4 Ravi 32 66 -34 5 Xien 77 49 28 6 Jalpa 86 72 14
Método n.º 2: usar el método sub() del marco de datos.
import pandas as pd # Create a DataFrame df1 = { 'Name':['George','Andrea','micheal', 'maggie','Ravi','Xien','Jalpa'], 'score1':[62,47,55,74,32,77,86], 'score2':[45,78,44,89,66,49,72]} df1 = pd.DataFrame(df1,columns= ['Name','score1','score2']) print("Given Dataframe :\n", df1) df1['Score_diff'] = df1['score1'].sub(df1['score2'], axis = 0) print("\nDifference of score1 and score2 :\n", df1)
Producción:
Given Dataframe : Name score1 score2 0 George 62 45 1 Andrea 47 78 2 micheal 55 44 3 maggie 74 89 4 Ravi 32 66 5 Xien 77 49 6 Jalpa 86 72 Difference of score1 and score2 : Name score1 score2 Score_diff 0 George 62 45 17 1 Andrea 47 78 -31 2 micheal 55 44 11 3 maggie 74 89 -15 4 Ravi 32 66 -34 5 Xien 77 49 28 6 Jalpa 86 72 14