In this Computomics Podcast episode, we launch the new “Make Sense of Science” explainer series, a format designed to break down complex scientific and technical topics into clear, accessible conversations.
In this first episode, Computomics machine learning scientist Alaukik Saxena introduces deep learning for genomic prediction in plant breeding. He explains how breeders use DNA marker data, field data, and environmental information to predict important plant traits such as yield, flowering time, or plant height, often before new plants are even grown. The episode also looks at why this is such a challenging task: breeding data is complex, environments differ greatly, and models can easily give overly optimistic results if they are not tested carefully.
Alaukik also explains the difference between classical statistical models, machine learning, and deep learning, and where each approach has its strengths. A key message of the episode is that good models depend on good data. Better phenotyping, smarter experimental design, and realistic validation are just as important as the algorithms themselves.
More:
|
|
Alaukik studied Mechanical Engineering at Panjab University and Materials Science and Simulations in Masters at Ruhr University Bochum. He acquired his PhD at Max Planck Institute for Sustainable Materials in Düsseldorf. His PhD work focused on using machine learning to automate and improve the analysis of atom probe tomography (APT) data, making sense of complex 3D datasets in a reliable and scalable way. He joined Computomics in September 2025 as a Machine Learning Scientist. He’s currently developing advanced machine-learning models for genomic prediction, modeling G×E×M effects. In practice, he integrates genome-wide marker data (e.g. SNPs), pedigree relationships, and high-dimensional environmental covariates to predict plant phenotypes more reliably across diverse environments. |
Share on