It looks like there is only about an hour of data in the data set. At this time scale, encoding a timestamp doesn’t help you. It looks like the data set is artificial, seeing that it has an even distribution of transactions (exactly one per second). Since there is a regular interval to the data entries (artificial or not), you should be fine just removing the timestamp_timeOfDay
field altogether (make it None
in the params).
Also, I see you are using a ScalarEncoder
for your binary isFraud
value. I would suggest this advice instead, as I believe it more evenly distributes bits and assures no overlap between values.