DataScience – Page 2

Everything you need to know about matrix in machine learning (I): Solve Ax = b

Written by Ju on March 31st, 2019April 3rd, 2019. Leave a comment

In machine learning, we are often dealing with high-dimension data. For convenience, we often use matrix to represent data. Numerical optimization in machine learning often involves matrix transformation and computation. To make matrix computation more efficiently, we always factorize a matrix into several special matrices such as triangular matrices and orthogonal matrices. In this post, I will review essential concepts of matrix used in machine learning.

(more…)

Numerical optimization in machine learning (I): the basics

Written by Ju on March 22nd, 2019April 28th, 2019. Leave a comment

During logistic regression, in order to compute the optimal parameters in the model, we have to use an iterative numerical optimization approach (Newton method or Gradient descent method, instead of a simple analytical approach). Numerical optimization is a crucial mathematical concept in machine learning and function fitting, and it is deeply integrated in model training, regularization, support vector machine, neural network, and so on. In the next few posts, I will summarize key concepts and approaches in numerical optimization, and its application in machine learning.

(more…)

Goal setting for function fitting (classification)

Written by Ju on March 5th, 2019March 31st, 2019. 2 Comments

In the previous post, I have discussed loss function in regression. In this post, I will elaborate how we develop loss function in classification when the output is discrete, rather than continuous.

(more…)

Do you write production code as a data scientist?

Written by Ju on February 21st, 2019March 31st, 2019. Leave a comment

In the past month, I posted this question to my friends, peers, online tech forum, and got responses from more than 30 data scientists in various industries and different academic background and career path. The responses show a wide spectrum of data scientists’ involvement in production, and reveal some shared concerns about career development among data scientists.

(more…)

Goal setting for function fitting (regression)

Written by Ju on February 18th, 2019May 30th, 2019. Leave a comment

Whenever we see the word “optimization”, the first question to ask is “what is to be optimized?” Defining an optimization goal that is meaningful and approachable is the starting point in function fitting. In this post, I will discuss goal setting for function fitting in regression.

(more…)

Restrict function search space by assumptions

Written by Ju on February 10th, 2019March 31st, 2019. Leave a comment

In the previous post, I discussed the function fitting view of supervised learning. It is theoretically impossible to find the best fitting function from an infinite search space. In this post, I will discuss how we can restrict the search space in function fitting with assumptions.

(more…)

Set up the supervised learning problem as function fitting

Written by Ju on February 5th, 2019March 31st, 2019. Leave a comment

In this very first post of the Connect the Dots series, I set up the supervised learning problem from a function fitting perspective and discuss the objective of function fitting.

(more…)

Connect the Dots

Written by Ju on February 4th, 2019May 23rd, 2019. 3 Comments

Linear Regression — https://xkcd.com/1725/

Entering Year 2019, I plan to start a post series discussing what I have learned in statistics, machine learning, big data, computer science, and neuroscience (always!). I name this series “Connect the Dots”, as in the puzzle game “connect the dots“.

(more…)

Ju Yang

Ph.D. / Machine Learning Practitioner in New York

Posts in the DataScience category:

Everything you need to know about matrix in machine learning (I): Solve Ax = b

Numerical optimization in machine learning (I): the basics

Goal setting for function fitting (classification)

Do you write production code as a data scientist?

Goal setting for function fitting (regression)

Restrict function search space by assumptions

Set up the supervised learning problem as function fitting

Connect the Dots