Share
Preview
Weekly roundup of all things MLOps.
 ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌

This just might be our biggest week yet! A quick overview of everything going on this week:

And...all the good stuff that happened last week:

ML Monitoring
For our first of 2 meetups on Tuesday this week at 5pm UK/ 9am PT we will be talking to Lina Weichbrodt about how to monitor ML stacks.

Monitoring usually focusses on the “four golden signals”: latency, errors, traffic, and saturation. Machine learning services can suffer from special types of problems that are hard to detect with these signals. The talk will introduce these problems with practical examples and suggests additional metrics that can be used to detect them. A case study demonstrates how these new metrics work for the recommendation stacks at Zalando, one of Europe’s largest fashion retailers.

By the way, we are breaking new ground at MLOps.community as she will be the first female guest on the meetup! I know it has taken entirely too long for this and I'm sorry 😅

Full Stack Data Scientist?  
Is there such a thing? Are they the rare unicorns that only exist in the place that I go to right before I fall asleep?

On Wednesday we will have Linkedin influencer and lead data scientist Alexey Grigorev share his thoughts on the subject.


We all know what we need to do to be good data scientists: know machine learning, be able to program, be fluent in SQL and Python. That’s enough to do our job quite well.

But what does it take to be a better data scientist?

The best way to grow as a data scientist is to step out of direct responsibilities and try on the hats of a product manager as well as a DevOps engineer.

In particular, we should:

- be pragmatic and product-oriented
- communicate more
- get into infrastructure

After attending this chat, you will know exactly how we can do it.

Workshop - Python and Dask 
Another first for the community... on Thursday this week at 5pm UK/ 9am PT, Dan Gerlanc will give us a hands-on workshop around calling the DataFrame with Python and Dask.

Python's most popular data science libraries—pandas, numpy, and scikit-learn—were designed to run on a single computer, and in some cases, using a single processor. Whether this computer is a laptop or a server with 96 cores, your compute and memory are constrained by the size of the biggest computer you have access to.

In this workshop, you'll learn how to use Dask, a Python library for parallel and distributed computing, to bypass this constraint by scaling our compute and memory across multiple cores. Dask provides integrations with Python libraries like pandas, numpy, and scikit-learn so you can scale your computations without having to learn completely new libraries or significantly refactoring your code.
Feature Stores + Gift Cards
If you are at all interested in learning about feature stores I highly recommend checking out our last meetup with the CTO of Tecton Kevin Stumpf. here is the video link and also the podcast link.

The good folks over at Tecton.ai said they wanted to do something for the community so if you want to test out tecton.ai when you sign up put that you are part of the MLOps community when you request early access and you will get entered into a raffle to win a 50$ amazon gift card! whoohoo!
Coffee Sessions #2
For round 2 of our coffee session series, it was the usual crew of David and I and we were also joined by the incredible Byron Allen to talk about 3 different ways of serving models. We commented heavily on this in-depth blog post around model serving. Check out the video here!

Let us know what you want the next coffee sessions to be about Feature Stores or Airflow? And as always if you want to join us in one of these you are invited!
Tool Talk: Paperspace
Community member David Aponte sat down with Misha Kutsovsky from Paperspace to learn a bit more about what MLOps should do and the technical role of MLOps in the machine learning pipeline.

They cover defining sensible primitives, abstracting away compute infrastructure, managing shared experiments from the CLI, collaborating via pull requests in shared repos -- and much more. Check out the video here and the podcast version here.
Meeting Adjourned
In a week full of firsts, this one really excites me! We had an MLOps board meeting to help decide on the future vision for the community. Some topics we discussed were ModelOps Labs, Git repo knowledge hub, sponsorships, and what success means for the community.

I want to open up the board meetings to everyone that feels part of this community so if you were looking to get more involved let me know and I'll add you to the Slack channel!
Best of Slack
Have a great week and feel free to reply back if you just need someone to talk to during confinement.



Email Marketing by ActiveCampaign