I have an invoice dataset where the currency (invoice_currency) comes in two values - CAD and USD. Hence, the column total_open_amount has values with different units, although unknown to the column itself. How should I do EDA on this considering I cannot find skewness in a column that has different units? How may I create two more columns on the basis of currency? How will it affect my ML prediction?
2.1m questions
2.1m answers
60 comments
57.0k users