This post describes our migration journey of Spark from EMR on EC2 to EMR on EKS.
A few months ago we released Hamilton, Stitch Fix’s open-source microframework for managing dataflows. With feedback from the community, we’ve implemented a variety of new features that make it more general purpose for Data Science/Data Engineering teams. Importantly, Hamilton now operates over any python object types, and can integrate with a variety of distributed compute platforms.
This post is a re-posting of an article first published at oreilly.com.
This post showcases what a Platform and DS team can accomplish collaborating at Stitch Fix.
Good for the business can also be good for the world. We’re proud and humbled to share our work using inventory algorithms to support entrepreneurs of color.
See who's out in the wild for the month of September.
Query based recommendations via representation learning