May 2020 – Ju Yang

Distributed model training II: Parameter Server and AllReduce

Written by Ju on May 20th, 2020September 21st, 2020. Leave a comment

In the previous post, I talked about using MapReduce and Spark for distributed model training. In this post, I will talk about parameter server and how it is used in distributed model training.

(more…)

Distributed model training I: MapReduce and Spark

Written by Ju on May 7th, 2020September 21st, 2020. 1 Comment

In the previous post, I introduced challenges in machine learning systems with big data and complex models. In this post, I will discuss distributed systems in the era of big data.

(more…)

Think big: ML systems in the era of big data

Written by Ju on May 1st, 2020September 11th, 2023. Leave a comment

Let’s start with linear regression. Using established libraries such as scikit-learn, it is almost trivial to train a linear regression model. We can easily run the model training with a few hundred Megabytes of data on our laptop with a build-in CPU.

Now let’s think big.

(more…)

Ju Yang

Ph.D. / Machine Learning Practitioner in New York

Posts from May 2020

Distributed model training II: Parameter Server and AllReduce

Distributed model training I: MapReduce and Spark

Think big: ML systems in the era of big data