Currently in the gradientbooser fit method, all of the data is used for determining cuts for binning the data. It would like speed things up, if we allowed for a sample to be used for the initial bin. This could be a parameter such as initial_bin_sample_size or something like that.