Imbalance, Stacking, Timing, and Multicore

Imbalance: see weights used by svm.SVC()'s class_weight='balanced'

Imbalance: oversampling, undersampling

Stacking

First check classifiers individually.

Try stacking the first three, using the fourth as the final estimator.

See if we can understand what the StackingClassifier() did.

Algorithm Efficiency and Timing Code

If you do not know the run-time of your algorithm, you can time it for several values to get some insight as to how the time relates to the input size.

e.g. Here we time sorting arrays of random numbers for each of several sample sizes and the make a plot to see the relationship between run time and sample size.

It would be easy to make a regression model, or just estimate the model by eye, and then predict the run time for a particular N. (I am recommending extrapolating here as better than nothing, even though extrapolation is typically risky.)

Multicore computing