Access to data increasingly massive have changed significantly the way to represent the health and the life. In this context, we need to develop statistical model which are suitable both for biological problem and large data set problem. In particular, we are interested in statistical model used during association studies between environment and genetics.