Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
1.0k views
in Technique[技术] by (71.8m points)

pandas - Column contains column 3

I have a dataframe. I would like to test whether, (C), on each row, the number in column (B) is in the string, column (A).

df = pd.DataFrame({'A': ["me 123", "me-123", "1234", "me 12", "123 me"],
                   'B': [123,       123,      123,    123,     6]})

I can do that using extract

df['C'] = df.A.str.extract('(d+)', expand=False).astype(int).eq(df.B,0).astype(int)

         A    B  C
0   me 123  123  1
1   me-123  123  1
2     1234  123  0
3    me 12  123  0
4  123  me    6  0

However, if one of the A values does not contain a number:

df = pd.DataFrame({'A': ["me 123", "me-123", "1234", "me 12", "123 me", "me"],
                   'B': [123,       123,      123,    123,     6,        123]})

Then I get:

ValueError: cannot convert float NaN to integer
See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Answer

0 votes
by (71.8m points)

Values NaNs are floats, so you can convert output to floats:

df['C'] = df.A.str.extract('(d+)', expand=False).astype(float).eq(df.B,0).astype(int)

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...