Text this: Unbalanced data processing using oversampling: machine Learning