Towards a statistical theory of data selection under weak supervision

Germain Kolossov, Andrea Montanari, Pulkit Tandon

International Conference on Learning Representations 2024 (ICLR 2024) Conference

Given a sample of size $N$, it is often useful to select a subsample of smaller size $n