Due to technological advances, large and high-dimensional data have become the rule rather than the exception. Methods that allow for feature selection with such data are thus highly sought after, in particular, since standard methods, such as cross-validated lasso and group-lasso, can be challenging both computationally and mathematically. In this paper, we propose a novel approach to feature selection and group feature selection in linear regression. It consists of simple optimization steps and tests, which makes it computationally more efficient than standard approaches and suitable even for very large data sets. Moreover, it satisfies sharp guarantees for estimation and feature selection in terms of oracle inequalities. We thus expect that our contribution can help to leverage the increasing volume of data in Biology, Public Health, Astronomy, Economics, and other fields.