Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
Chris Stucchio
Everyone has used linear regression. It’s boring, standard mathematics that we learned in Stats 101.
But how many of us really understand it at a deep level? One of the “rules” of linear regression is that your features must not exhibit multicollinearity. But where does this rule come from? What happens if we violate it? Many people suggest regularization or ridge regression as a solution, but why do these methods work? What are we actually doing?
In this talk I’ll discuss linear regression from the Bayesian perspective. This is a simple way to think about it which makes the answer to these questions quite transparent. It also provides an avenue to solve various harder problems (e.g. non-gaussian errors) that you might not have seen before.
As a running example I’ll consider predicting scores in fantasy sports, specifically the scores of a batter in Baseball or Cricket.
End goal of this talk: if you have highly correlated input data, non-normal errors, domain knowledge exceeding input data, or other common problems, you shouldn’t get stuck. You might need to custom hack some tools
Familiarity with linear regression and some basic mathematics (calculus, linear algebra) is helpful. If you’re familiar with Bayesian reasoning, so much the better (but not required).
Chris Stucchio is a former physicist, high frequency trader and software developer. He’s currently the head of data science at Simpl. He’s been working in decision theory and bayesian optimization for the past 5 years, and has been teaching statistics to novices for much longer.
Jul 2018
23 Mon
24 Tue
25 Wed
26 Thu 07:45 AM – 06:15 PM IST
27 Fri 07:45 AM – 05:35 PM IST
28 Sat
29 Sun
Hosted by
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}