Share
Preview
Weekly round up of everything happening in the MLOpshere
 ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌

Well I haven't done that in a while, and now I remember why. Three different pieces of content to indulge in from the past week and man is each one straight quality!

Buckle Up Its Flyte Time!
Ketan Umare the creator of the open-source project that came out of Lyft's engineering squad; Flyte talked with us about what the production-ready system does and doesn't do. Side note Ketan is such an awesome guy and super humble, so glad to have talked with him. Check out the flyte website here to learn more and drop them a star on github!

Check out our
Coffee Sessions #12: Flyte: an open-source tool for scalable, extensible, and portable workflows. Podcast and video
Double Dask!
The two creators of Dask sat down with David and I to talk shop around what exactly Dask is and when to use it. I also convinced Matt to be the guest on our meetup this week! so we effectively have a double dask week! Check out the video / podcast here of our first chat and be prepared for a lot more of those insights in round 2! Here is a quick overview of what we will talk about on Wednesday:

Python makes data science and machine learning accessible to millions of people around the world. However, historically Python hasn't handled parallel computing well, which leads to issues as researchers try to tackle problems on increasingly large datasets. Dask is an open source Python library that enables the existing Python data science stack (Numpy, Pandas, Scikit-Learn, Jupyter, ...) with parallel and distributed computing. Today Dask has been broadly adopted by most major Python libraries, and is maintained by a robust open source community across the world.

This talk will discuss parallel computing generally, Dask's approach to parallelizing an existing ecosystem of software, and some of the challenges we've seen in deploying distributed systems.

Finally, we'll also address the challenges of robustly deploying distributed systems, which ends up being one of the main accessibility challenges for users today. We hope that by the end of the meetup attendees will better understand parallel computing, have built intuition around how Dask works, and have the opportunity to play with their own Dask cluster on the cloud.
What Are These Data Scientist Actually Doing?
Elizebeth Chabot drove home some absolutely incredible points around the ML reality of current-day small to medium size startups. She had so many great stories from the different experiences she has encountered over the years.

Lesson 1. Collect data.
Lesson 2. Collect the right data.
Lesson 3. Watch the video.
Put ML In Prod With Sagemaker
Community member and ex meetup guest Neylson Crepalde has become the first person to write a guest post on our MLOps medium page. Check out his article helping inspire you to get your model into production with Amazon Sagemaker! We plan to start sourcing more content from all the experts in the community so if you have something interesting you want to write about feel free to reach out!

Eating Up The Food Chain
Community member Mark Peters shared what seems to be a pretty rocking event in the #be-shameless channel on slack and I couldn't help but repost. (ps lots of good stuff happening in that channel)

Free virtual event from the DevOps institute talking about the ML problems in the DevOps chain. Looks like it will be quite the event and I might even have to pirate some of the material! You know what they say about great artists!

 
Best of Slack
Have a great week! Check out our slack, youtube, and podcasts if you haven't already. Also, it would mean a lot to me if you filled out this form so I can learn more about the community.



Email Marketing by ActiveCampaign