How to load randomly sampled k% from train set?
You can .shuffle() the dataset and then .select(range(int(percentage * len(ds))))
.shuffle()
.select(range(int(percentage * len(ds))))