| Another week, another amazing podcast! Slater Victoroff, the CTO and founder of Indico Data, joined us to discuss ETL for unstructured data, multi modal machine learning, and his perspective on how to data programming is real future of ML engineering (and how Indico makes this happen!)
This was both a theoretical and practical conversation. Slater has been involved with ML since the AlexNet days. He shared his wisdom from
along the way, especially on how to thinking fundamentally about data and information flows in machine learning. We discussed some of the opportunities and flaws of synthetic data and active learning. Both techniques can make models a lot better, but the architecture of the data systems that enable these techniques is really crucial to realize their potential.
On the practical side, Slater walked us through how Indico deals with the ETL challenges of representing unstructured data. We jammed on how data engineering is causing more of the challenges in machine learning than modeling. Representing data in a flexible fashion is crucial to Indico's unconventional modeling solutions, which takes modeling out of only the data scientist's hands and puts subject matter experts in a position to make and deploy their own models. Sound fascinating? It really is, and we got into the nitty gritty of how Indico does this with a sample use case.
Thanks to Slater
for joining us and being such a thought-provoking guest! Definitely listen to this session to get some knowledge dropped.
Till next time, Vishnu
|