You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
On the off chance that you are still interested or care about this problem - I'd say that the most "fair" way to do it might be to randomly sample 10-20% from each of the year splits.
Possibly better, if you are someone from the competitive debate community, would be to randomly sample that same 10-20% but at the level of each individual file. This would prevent the random sampling from favoring certain files/annotators over others and would hopefully maximize the diversity of the samples.
Hi !
How could I split the data for Train/dev/test?
The text was updated successfully, but these errors were encountered: