Tutorials – Win Vector LLC

Channel: Tutorials – Win Vector LLC

Image may be NSFW.
Clik here to view.

A Deep Dive on The Drunkard’s Walk

November 26, 2023, 8:54 am

I continue my series on the mathematics of Markov chains with a deep dive on The Drunkard’s Walk. This is a set-up for more on Wald’s Sequential Analysis (a near relative of A/B Tests). A great thing...

View Article

Image may be NSFW.
Clik here to view.

The Biased Drunkard’s Walk

December 4, 2023, 4:31 pm

Our “Markov Chains leading up to A/B tests” series continues with The Biased Drunkard’s Walk. In this note we use the theory of Toeplitz matrices to analyze a variant of the drunkard’s walk that I am...

View Article

Image may be NSFW.
Clik here to view.

Conditioning on the Future

December 27, 2023, 11:42 am

In both A Slightly Unfair Game and The Drunkard’s Walk In Detail we showed a fair random walk that moved up or down with 50/50 probability. Some of these walks stopped when they were absorbed at zero,...

View Article

Image may be NSFW.
Clik here to view.

What You Should Know About Linear Markov Chains

January 1, 2024, 11:17 am

I want to collect some “great things to know about linear Markov chains.” For this note we are working with a Markov chain on states that are the integers 0 through k (k > 0). A Markov chain is an...

View Article

Tools for Jupyter in (and near) Production

January 28, 2024, 5:35 pm

I am sharing a tutorial video showing “run Jupyter in production” tools (including the ability to remove the Jupyter dependency). The point is: how to let the analyst work in Jupyter and without great...

View Article

Image may be NSFW.
Clik here to view.

Use Jupyter Notebooks Inside For-Loops

February 6, 2024, 12:09 pm

Introduction In my opinion, a number of “moving data science to production” problems are solved if one could just use a Jupyter notebook inside a for-loop. The wvpy package supplies the tools put...

View Article

Image may be NSFW.
Clik here to view.

What Good is Analysis of Variance?

February 28, 2024, 1:11 pm

Introduction I’d like to demonstrate what “analysis of variance” (often abbreviated as “anova” or “aov”) does for you as a data scientist or analyst. After reading this note you should be able to...

View Article

Image may be NSFW.
Clik here to view.

Illustrating the F-test in Action

March 4, 2024, 3:30 pm

I have a new note showing how the F-test works here. The F-test is a good way for quantifying model effectiveness. I think it pairs nicely with my earlier ANOVA article. Please check it out.

View Article

Image may be NSFW.
Clik here to view.

How Data Quantity Drives Model Quality

April 2, 2024, 11:31 am

I’d like to share a video introduction to a new article on training set size. I am trying to explain some of the subtleties of evaluating “in sample” (on data used during the model inference procedure)...

View Article

Image may be NSFW.
Clik here to view.

The m = n Machine Learning Anomaly

April 3, 2024, 10:59 am

In our note “How Data Quantity Drives Model Quality” we worked on how the training data size controls model quality in linear regression. At that time, to avoid some true horror, we deliberately...

View Article

Latest Images