Random Problems

Visualizing the 3n+1 Problem

2024-01-12T22:10:00.000-08:00

You might have heard of the 3n+1 problem at some point. It is pretty cool to visualize.

Problem

The problem basically states that if you do the following:

take any positive integer
if it's even, divide it by 2
if it's odd, multiply it by 3 and add 1 to it
repeat this and stop once you've gotten to 1

you will eventually stop at 1. Take 7 for example:

7 is odd so next number is 3(7) + 1, or 22
22 is even so next number is 22/2, or 11
11 is odd so next number is 3(11) + 1, or 34
34 is even so next number is 34/2, or 17
17 is odd so next number is 3(17) + 1, or 52
52 is even so next number is 52/2, or 26
26 is even so next number is 26/2, or 13
13 is odd so next number is 3(13) + 1, or 40
40 is even so next number is 40/2, or 20
20 is even so next number is 20/2, or 10
10 is even so next number is 10/2, or 5
5 is odd so next number is 3(5) + 1, or 16
16 is even so next number is 16/2, or 8
8 is even so next number is 8/2, or 4
4 is even so next number is 4/2, or 2
2 is even so next number is 2/2, or 1
1 is the stopping point

You can see that it bounces around a bit. What does that bouncing look like? Here are the paths for the numbers 2 to 31 (watch until at least 27):

Pretty cool that 27 seems to explode out of nowhere. How does the iterations required to get to 1 change based on the starting value?

And finally, what's the max value you get from each starting value?

What Were the Most Dominant Dynasties in College Football?

2023-03-04T21:57:00.004-08:00

It's hard to get a sense of how much Nick Saban's Alabama teams are dominating compared with Bobby Bowden's FSU ones.

To try to visualize it, I created a score to represent a program's momentum. The algorithm works like this for each season:

1st place adds 6 pts
2nd place adds 3 pts
3rd place adds 2 pts
4th place adds 1 pts
Lose one point if you finished outside the top-4 two years straight
Increment the points you lose by 1 for each year past two for the previous bullet
Reset points lost increment if you finish in the top-4

I used the AP poll for all rankings because it exists and is easily mineable for all years in this post. As an example, consider this table of score vs year and position:

year	position	score
2000	10th	0
2001	1st	6
2002	10th	6
2003	1st	11
2004	10th	11
2005	10th	10
2006	10th	8
2007	3rd	10
2008	10th	10
2009	6th	9
2010	10th	7
2011	8th	4
2012	10th	0

The general idea is that you grow momentum by consistently finishing near the top, championships boost it by a lot, and you need sustained seasons outside the top to lose momentum.

One interesting way to view this is geographically. In the plot below, each circle's size is proportional to the score of the team located at the center of the circle:

You clearly see giant circles appear for famous dynasties like Bowden at Florida State and Saban at Alabama. Another interesting takeaway from this view is how much dominance has shifted to a small geographic area in the southeast and Ohio.

Focusing in on top dynasties, here are those plotted over time:

Pretty cool. From this metric at least, Saban's current run at Alabama is historic.

Overall, I'm surprised at how non-dominant Nebraska and Penn State were. I viewed their reigns as similar to Miami's, and they aren't really near that level from this metric at least.

Mario Party's 'Hide and Sneak' is not Balanced

2022-07-01T21:28:00.005-07:00

Are you consistently losing to a 6 year old and looking for an excuse. If so, you've come to the right place.

'Hide and Sneak' is one of the mini-games in Mario Party. The basic rules are that 3 people hide and 1 person finds them. In round 1, there are 4 hiding spots. In round 2, 3. In round 3, 2.

Each player on the hiding team picks a spot to hide and the seeking player picks one spot. If a player is hiding in the picked spot, he's out.

If a player is out he doesn't participate in later rounds.

If any players remain on the hiding team after 3 rounds, the hiding team wins.

Is this game 50/50? Working through the math, start with 1 hider:

round 1, there's a 3/4 chance of not being found
round 2, there's a 2/3 chance of not being found
round 3, there's a 1/2 chance of not being found

Thus, for an individual player on the hiding team, there's a (3/4) * (2/3) * (1/2) chance of not being found. That's 1/4, or 25%.

Inverting that, the seeker has a 3/4, or 75% chance of finding a given player after 3 rounds.

Since there are 3 players and their hiding decisions are independent, the chance of the seeker finding all 3 players in 3 rounds is just (3/4)^3, or 27/64. That's only 42%. It isn't balanced at all.

Is there an obvious way to balance it? What if we did 4 rounds with 5 starting hiding spots. For one hider:

round 1, there's a 4/5 chance of not being found
round 2, there's a 3/4 chance of not being found
round 3, there's a 2/3 chance of not being found
round 4, there's a 1/2 chance of not being found

Multiplied out, that's a 1/5, or 20% chance of not being found so 4/5, or 80% success chance for the seeker. The chance that the seeker finds all 3 players in 4 rounds then is (4/5)^3, or 51%. 4 rounds is way more balanced than 3 rounds.

A Simple Dice Game

2022-05-30T20:20:00.005-07:00

My son asked about a seemingly simple dice game and I didn't know how to answer immediately. The question is:

Player 1 rolls a 6-sided die. Player 2 rolls a 6-sided die. If player 2 rolls the same as player 1, he rolls again. If he again rolls the same, he loses. How often will each player win?

It seems obvious that player 1 will win since rolling a 6 guarantees a win for him and getting the same thing twice guarantees a loss for player 2. The atual math was interesting to reason through. We can simply go through each possibility.

Player 1 rolls a 1
- Player 2 wins with anything else (5/6). If he rolls a 1 (1/6 chance), roll again with a 5/6 chance of winning
- Player 2 win chance is 5/6 + (1/6)*(5/6) = 35/36
Player 1 rolls a 2
- Player 2 wins with a 3 or up (4/6). If he rolls a 1 (1/6 chance), roll again with a 4/6 chance of winning
- Player 2 win chance is 4/6 + (1/6)*(4/6) = 28/36
Player 1 rolls a 3
- Player 2 wins with a 4 or up (3/6). If he rolls a 1 (1/6 chance), roll again with a 3/6 chance of winning
- Player 2 win chance is 3/6 + (1/6)*(3/6) = 21/36
Player 1 rolls a 4
- Player 2 wins with a 5 or up (2/6). If he rolls a 1 (1/6 chance), roll again with a 2/6 chance of winning
- Player 2 win chance is 2/6 + (1/6)*(2/6) = 14/36
Player 1 rolls a 5
- Player 2 wins with a 6 (1/6). If he rolls a 1 (1/6 chance), roll again with a 1/6 chance of winning
- Player 2 win chance is 1/6 + (1/6)*(1/6) = 7/36
Player 1 rolls a 6; he wins automatically

To get Player 2's win chance then, we just sum his chance for all possibilities and divide by the number of possibilities (6). That's (1/6)*(35/36 + 28/36 + 21/36 + 14/36 + 7/36) = 105/216 = 48.6%.

Thus, Player 1 wins 100 - 48.6, or 51.4% of the time.

What if we change the rule slightly? Try this game:

Player 1 rolls a 6-sided die. Player 2 rolls a 6-sided die. If player 2 rolls the same as or less than player 1, he rolls again. If he again rolls the same, he loses. If he rolls higher in the second roll, he wins. How often will each player win?

I'll just do player 1 rolls a 2 as an example then present the answer since the logic is so similar.

If Player 1 rolls a 2, Player 2 will win with a 3 or above so 4/6. He will reroll for a 2 or below so 2/6 of the time, and his chance of winning that second roll is 4/6. Thus, if Player 1 rolls a 3, Player 2 wins 4/6 + (2/6)*(4/6), or 32/36.

Summing all paths up again, Player 1 wins this game 42.2% of the time.

To try to make things a bit fairer, consider one more game:

Player 1 rolls a 6-sided die. Player 2 rolls a 6-sided die. If player 2 rolls the same as or less than player 1, he rolls again and subtracts 1 from his roll. If he again scores the same, he loses. If he scores higher in the second role, he wins. How often will each player win?

Going through the same logic, this is the fairest of these games. Player 1 wins 49.0% of the time.

Simple Tutorial for Hosting a CRUD Node App on AWS Elastic Beanstalk

2021-11-10T23:10:00.008-08:00

I had trouble finding (working) simple tutorials for running Node.js CRUD apps on AWS using Elastic Beanstalk so I wrote one from scratch and documented it.

The app

I've put two phases together. The first is a simple hello world using Node.js, and the second is a simple app that lets you add a number to a MySQL database and read back all numbers that you've added. Below is what it looks like when it's working:

Please do not use this as a reference for writing Node cleanly or anything like that. This is structured to (hopefully) be very minimal and easy to follow example for setting up Node.js and MySQL through this service and isn't intended to be a serious app.

Setting up your AWS environment

Set up an environment using the AWS documentation. When writing this, the steps were:

create an environment

choose 'Web server environment'

configure environment

give it a name
choose node js for type
use defaults for node and linux
choose 'upload your code' at the bottom
upload a zipped folder with the code from one of the examples I listed above; it should look like this (i.e., files directly in the zipped folder and not a folder containing them)

configure more options

create a database and choose mysql

create

wait 10 minutes or so

These might change though so the AWS documentation is likely a good place to start for initial environment creation.

When complete, it should look something like this:

Quick exploration of the environment

Two things that you might want to do right away are manage versions of your app, and look at logs.

To manage versions, simply go to 'Application versions' on the left and you can deploy, delete, etc. any of them:

To look at logs, simply go to 'Logs' on the left and you can pull them.

Quick overview of the hello world app

package file:

The package file for this is very simple. It just defines express v3.1.0 as a dependency and tells the environment that 'node app.js' is the starting script to run.

app:

Things worth noting here:

AWS manages the ports for you; 'const port = process.env.PORT || 3000;' says 'use port 3000 unless the host has configured a port for you'

'const dir = `${__dirname}/public/`;' grabs the public folder for the app and puts it in dir

'app.get("/", (req, res) => { res.sendfile(dir + "index.html"); });' serves up the index.html file in the public folder when the default url is visited

That's it.

What the CRUD app adds to the hello world example

package file:

This app needs to use mysql and there's a convenient async package for smoothing out async usage in node that I used here.

app:

There are a few new things here:

database configuration

AWS manages the db so generate a connection using the environment variables for it
in order

create the db if it doesn't exist
set it as the db to use
create the 'numbers' table if it doesn't exist

log error or success message depending on how that sequence of calls went

route configuration

the app has two endpoints

history: return all numbers entered so far
new: add a new number

route:

The new endpoints are just SQL queries and those are implemented in 'numbers.js' in the routes folder.

index.html

This is a basic numeric input and button. Page load gets the history of numbers entered. Clicking the button adds the number in the input then gets the history.

Basic summary then is:

navigating to the page returns index.html

index.html auto-calls the history and displays a formatted response

calling history actually calls the 'getNumbers' method in numbers.js

submitting a new number calls new and then history

calling new actually calls the 'addNumber' method in numbers.js

Summary

That's all that's required to host a simple Node.js + MySQL app on AWS Elastic Beanstalk. Hopefully nothing has changed significantly by the time you read this that breaks this tutorial as that happening to a couple of others I read through is what prompted me to write this. Feel free to comment if you hit issues.

How Do You Subtract Binary Numbers?

2021-10-24T21:43:00.003-07:00

What is the value of something like 101001 - 1101?

It's probably easiest to understand this by first going through subtraction of normal (base 10) numbers. What is 119 - 35? You do the following:

start with 1's digit; 9 - 5 = 4
move to 10's digit; 1 - 3 = -2; negative numbers make this hard, so 'borrow' 10 from the 100's digit; now the 10's digit is 11, so 11 - 3 = 8
100's digit lost 1 in the top number in the borrowing, so it's now 0; bottom number's 100's digit is 0 also

So you have 0 in the 100's place, 8 in the 10's, and 4 in the 1's, so 84 is the result.

Subtracting binary numbers works identically, except instead of borrowing 10 and using powers of 10 (1's, 10's, 100's, etc.) as places, you use powers of 2 (1's, 2's, 4's, etc.).

Starting with an easy one: 10 - 1 (in binary):

start with 1's digit; 0 - 1 = -1; negative numbers make this hard, so 'borrow' 2 from the 2's digit; now the 1's digit is 2 - 1 = 1
move to 2's digit; it lost 1 in the borrowing, so it's now 0; bottom number's 100's digit is 0 also

So you have 0 in the 2's place, and 1 in the 1's place, so the answer is 1. Double-checking, 10 in binary is 2, and 1 is 1, so 10 - 1 in binary is the same as 2 - 1 in normal (base 10) which is obviously 1.

That's it. It works the exact same way as base 10 subtraction except that instead of borrowing 10, you borrow 2.

Now for the original problem: 101001 - 1101

1's digit is 1 - 1 = 0
2's is 0 - 0 = 0
4's is 0 - 1; borrow from the 8's to get 2 - 1 = 1
8's had the borrow on the top, so it's now 0 - 1; borrow from the 16's to get 2 - 1 = 1
16's had the borrow on the top, so now it's -1 - 0; borrow from the 32's to get 1 - 0 = 1
32's had the borrow on the top, so now it's 0 - 0 = 0

So the result is just 11100. Converting that to base 10, that's 28. Checking by converting the original problem, that was 41 - 13, and the answer is also 28.

How Do I Determine My Raise Given Inflation?

2021-10-14T22:23:00.009-07:00

If you get a 10% raise, and inflation is 6%, did you actually get a raise?

To get it out of the way, your actual raise is given by:

\[{\text{actual raise = }\frac{\text{new salary}}{\text{old salary * (1 + inflation rate)}}} - 1\]

Where does this come from? It's maybe easiest to think of this in terms of units. Say you make $50,000 now, and you made $40,000 last year. You make 20% more right? Not exactly. What the $ there really represents is some purchasing power. Inflation is a drop in purchasing power, so what you really need to do is convert the $ before and after to the same unit. To determine the value of $ in the current year in terms of the $ in the previous year, you just divide it by 1 + inflation rate. That gives you the equation above.

Plugging in the numbers in the initial question then, the actual raise is:

\[{\frac{\text{new salary}}{\text{old salary * (1 + inflation rate)}}} - 1\]

\[{\frac{\text{old salary * (1 + 0.10)}}{\text{old salary * (1 + 0.06)}}} - 1\]

Which is just 0.038, so the actual raise is 3.8%.

It is very important to understand your raise in terms of local inflation. If you get a 5% raise but your area gets 10% more expensive, you actually got a paycut (4.5% paycut given those numbers).

Can You Confirm Performance Improvements With Noisy Software Benchmarks?

2021-10-02T22:32:00.003-07:00

Say you run 20 tests before and after a code change meant to speed up the code, but there's a lot of noise in your benchmarks. Some simple statistical tests can help you determine if you actually have an improvement in that noise.

Sample Data

Imagine your 20 runs before and after look like this:

Before (ms)	After (ms)
241	272
224	211
202	226
243	234
246	205
229	279
209	208
231	212
258	218
287	198
270	215
262	244
227	215
200	175
291	220
290	218
184	218
319	247
250	245
229	199

In case you prefer histograms:

The 'after' numbers look like they're maybe smaller. If you take the average you get 245 ms before and 223 ms after. Is that really better though or are you just seeing noise?

T-Test

Assuming your benchmarking noise is roughly normally distributed, you can use a T-Test. If you have never seen a T-test, a really rough description is that it will take two groups of numbers, and tell you if the means of the two groups are significantly different (i.e., the difference between them probably isn't just noise).

What does 'probably' mean here? You get a p value out of T-Tests that is the probability that they're the same. E.g., a p value of 0.05 would mean roughly 'there's a 5% chance that the ~20 ms difference here is just noise'.

You can do this in excel, google sheets, any of the many websites that do it, etc. I tend to use Python for this sort of stuff so a simple overview of how to do it in Python is:

import stats from scipy
call the ttest_ind method in it with the before numbers as the first arg and the after numbers as the second
the t value returned should be positive (since before should be higher than after) and the p value should be 2*target probability

For the numbers in the example here, I get a p value of 0.03 which is less than the common target of 0.05, and recall earlier that I noted it's 2*target probability, so this is effectively a probability of 1.5% (p value of 0.015) which would generally mean 'significant difference'. Note that 'significant' here doesn't mean important...just unlikely to be noise. The difference in means is still the primary metric here.

To summarize this then, you could say that the update significantly altered the benchmark time, and the difference in means is ~20 ms (or a ~10% performance improvement).

Why divide by 2?

This is an artefact of the method you use. In this case, the method I gave for testing this tests both sides of the assumption (i.e., tests both before > after and before < after). We only care about the before > after side though. This method actually handles this for you in current versions but I have an older version installed and wanted to put the more generic.

Why ttest_ind?

There are a lot of variants of T-Tests you can run. It's worth reading through them but I won't rewrite tons of info on them here. The ttest_ind I used is for independent samples of data. You might argue that a paired one is better here since 'making code faster' is sort of changing one aspect of a thing and testing it again, but ttest_ind works well in general usage.

Mann-Whitney

What if you have outliers and/or do not have a normal distribution of noise in your benchmarks? For a concrete example, what if the first number in the 'after' column is 600 instead of 272? T-Tests are not valid in these situations. Running it blindly returns a p of 0.4 which would indicate not significantly different, all from that single bad outlier.

You can auto-exclude best and worst n times. You can manually scrub data. That sounds really manual though and we want to automate things. You can also use another type of test. One that's useful here is the Mann-Whitney U test.

The results are similar to a T-Test but the test itself is looking for something slightly different. Roughly, this test tells you how likely it is that the results are such that a random value chosen from after is just as likely to be greater than a random value chosen from before as vice-versa. Since it doesn't care about the magnitudes (only the orders), it is fine for outliers and non-normally distributed data.

Same basic flow in Python:

import stats from scipy
call the mannwhitneyu method in it with the before numbers as the first arg and the after numbers as the second; also pass in 'two-sided' as the alternative to be consistent with the T-Test above if you want
the p value should be 2*target probability

With the numbers here, I get a p value of 0.04, so dividing by 2, 0.02. This test was not tripped up by the outlier.

Why Do We Multiply the Way We Do?

2021-09-13T22:18:00.014-07:00

We could just repeatedly add the numbers but we don't. Is the algorithm we use actually faster?

What I'm talking about here is multiplying 219 x 87 like the following:

7x9 to get 63
7x1 to get 7 and add a 0 to get 70
7x2 to get 14 and add two 0's to get 1400
8x9 to get 72 and add a 0 to get 720
8x1 to get 8 and add two 0's to get 800
8x2 to get 16 and add three 0's to get 16000
add all those together to get 19,053

That's 6 simple multiplications and 6 additions. If we just added 219 to itself 87 times, that's 87 operations so clearly more steps with one big assumption:

you've memorized m x n for all integers m and n from 2 to 10.

This is why we all had to learn times tables. How does this generalize as an algorithm?

Repeated addition is ~ operation per 'smallest of the two numbers', so that is just O(n) where n is the smaller of the numbers.

The algorithm we actually use is a bit harder. It scales as a x b where a and b are the number of digits in the two numbers. How does 'number of digits' scale? That's O(logn) where n is the number. Since it scales as the product of those, that algorithm scales as O(logm * logn) where m and n are the two numbers.

What about the memorized simple multiplications? I have no idea how our memory access scales, but I'm going to just guess it's a constant time operation for simple multiplication so O(1) which doesn't contribute.

For an example with actual calculations, here is the cost of multiplying each number up to 99 by 99 using each algorithm:

It might not be obvious that O(logm * logn) is faster than O(n) but with actual numbers in the plot there it becomes pretty clear.

It's cool to me that a basic math thing we all learn when we're little kids effectively uses a dynamic programming algorithm (memorize all m x n for m and n up to 10; convert every multiplication problem into a combination of m x n problems that you already solved).

Exploring Senior Software Engineer Salary Data in levels.fyi

2021-08-31T21:43:00.000-07:00

levels.fyi is a great resource for software salary info and it's easily mineable. I was curious how salaries in what are sometimes considered medium cost-of-living cities compare.

Software careers often have levels (hence the site name). Typically there's entry with 0-2.5 years, next at 2-5 years, then career level at 5-10 years. Some go above that (principal, chief, etc.). The one I'll play with here is the 5-10 year one. 5-10 year is often called 'senior software engineer'.

Here are the rough pay distributions in levels for that experience range in mid-priced cities (this is total compensation and not base salary):

A comparison is hard because it's not clear that each city represents the same thing. For example, many are state capitals so if 90% of the jobs are state ones then you'd expect them to be lower. Here's a list of the top-3 included employers for each city in that plot to hopefully provide more context:

Pittsburgh: Google, Uber, Argo AI
Chicago: Paypal, Expedia, Accenture
Denver: Amazon, Deloitte, Gusto
Austin: Apple, IBM, Amazon
Detroit: Amazon, General Motors, Quicken Loans
Atlanta: VMWare, Salesforce, Microsoft
Raleigh: IBM, Cisco, Microsoft
Nashville: Amazon, Asurion, HCA Healthcare
Phoenix: American Express, Intel, Amazon

Amazon is everywhere apparently...

These numbers aren't perfect obviously. Many people do work for the state for example and they don't seem to be providing salaries here, so I'd wager that levels.fyi is biased towards higher-paying companies. Fun data though.

How to Add a Vertical Scrollbar to Plotly

2021-08-01T21:26:00.002-07:00

Plotly doesn't have the built-in ability to scroll vertically with a fixed x axis unfortunately, but you can mimic that fairly easily...

First, here's the demo:

See the Pen vertical scroll plotly by Robert Hamner (@rhamner) on CodePen.

The basic model here is two stack two plots directly on top of each other where top is in a scrollable div and bottom is not.

Make two divs

plot div

scrollable
width = plot width + scroll width

xaxis div

not scrollable
width = plot width

Make two plots

plot

goes in plot div
y-axis zeroline is hidden
bottom margin is 0

xaxis

0 top margin
hide the modebar

Make the plot xaxis ranges equal

You can then get as complicated as you need to here. I added really crude layout event linking to the demo...I'm hitting a weird double-click bug (should autoscale but isn't) but this works pretty easily/cleanly as the basic concept.

If 10 Vaccinated and 10 Unvaccinated People Die, Can We Still Say Vaccines Work?

2021-06-30T21:52:00.004-07:00

You will almost certainly be seeing headlines about vaccinated people dying and might even see that more vaccinated than unvaccinated die. Here's one from the week that I wrote this post. Why do we still say vaccines work if this is happening?

Imagine as an example that you see '10 vaccinated and 10 unvaccinated doctors died from COVID-19 today'. Your brain probably thinks 'well...the vaccine didn't work I guess.' We see those numbers, then just assume that the populations were similar. They're all doctors right?

Digging more, say that it turns out that 90% of the doctors were vaccinated. To make it easy, assume that there are 1,000 total doctors. 90% vaccinated means there were 900 vaccinated and 100 unvaccinated. If 10 died from each group, that means:

10 / 100, or 10% of unvaccinated doctors died
10 / 900, or 1.1% of vaccinated doctors died

Unvaccinated doctors were 9 times as likely to die as vaccinated ones. Another way of phrasing that is that the vaccine's efficacy was:

vaccine efficacy = 1 - (vaccinated risk/unvaccinated risk) = 1 - (0.011/0.1) = 89%

This is how you have to think about things like this. Vaccines, masks, seat belts, helmets, etc. aren't 100% effective. Use the calculation above whenever you see headlines like this and want to know the actual story.

You can even have more vaccinated deaths than unvaccinated. Imagine for the 89% efficacy vaccine above, you have 99% of the population vaccinated. For 10,000 doctors in that example, you'd expect to have 10% of the 100 unvaccinated die and 1.1% of the vaccinated 9900 die, so that's 10 unvaccinated deaths and 120 vaccinated deaths. A highly effective vaccine can still have more vaccinated people die than unvaccinated ones.

In case a visual helps, here is the initial example's distribution as a colored grid (red = dead and green = alive):

Wheel Options Strategy Simulations

2021-06-13T21:56:00.005-07:00

The 'Wheel' is an options strategy that combines cash-secured puts with covered calls. I sometimes have trouble really grasping options strategies in my head, so simulating some scenarios gives me a better feel.

Basic strategy

To keep it simple, I will just deal with 'at the money' (ATM) options here. The basic strategy then is:

Sell a cash-secured put to start
If the stock goes up, the put expires so you sell another put
If the stock goes down, the put is exercised, you're assigned the shares, so you can sell a covered call
If the stock goes down from there, the call expires so you sell another call
If the stock goes up from there, the call is exercised, you sell the shares, so you sell another put
repeat...

You can see that when you are assigned shares, you just sell a call, and when a call is exercised, you just use the cash to sell a put. This repeats indefinitely. Isn't this just free money? Sort of...what you're trading off here is a bit hard to see immediately. This is where playing with some numbers can make it easier to understand what's happening.

Simple examples

To get a better idea of how this works, let's look at 3 simple examples:

stock doesn't change much
stock drops ~15% in a year
stock gains ~15% in a year
stock crashes ~15% and rebounds in a year

In each scenario, I'll add a bit of noise and assume that selling a put yields $1.25/month, selling a call yields $1/month, and these are all monthly expirations and ATM strikes with a starting value of $100.

Imagine in the first one the price for the first 5 months is 100, 102, 101, 97, 101. What does the wheel strategy look like?

sell put for $1.25 with a $100 strike; gain $1.25 from the sell and lose nothing in stock

price hits $102; that's above the $100 strike so it expires; sell another $1.25 put with a $102 strike and lose nothing in stock

price hits $101; that's below the $102 strike so pay $102 for the shares and sell a $1 call with a $101 strike

price hits $97; that's below the $101 strike so it expires; sell another $1 call with a $97 strike

price hits $101; that's above the $97 strike so sell at $97 and sell another $1.25 put with a $101 strike

Overall, the wheel earned $5.75 from selling options, but lost $4 in the stock (bought at $101 and sold at $97). That stock loss is the most obvious loss here but there's another more subtle one. Look at the first put again. The gain from selling the option was $1.25, but the stock itself gained $2 then. The gain was effectively capped at $1.25. The same is not true for the loss. When the stock fell, the entire loss was absorbed. Capping gains while having to absorb losses is a primary tradeoff here (some other ones are poor tax performance, low-liquidity, and potentially missing dividends).

Now that that's understood, it's helpful to me to see this graphically, so here are sample runs of the examples from above:

The general behavior here is that the wheel smooths out the plots a bit. Increases and decreases aren't quite as big. You can control how smooth it is by changing expiration dates and strike price offsets (e.g., selling calls with a strike price 10% above current price will allow for larger gains but give you less option premium, so the performance looks more like buy and hold). When a stock crashes, you'll probably do a bit better with the wheel. When a stock surges, you'll probably do a bit worse with the wheel.

The above plots are single trials of the simulation. What does it look like if this is run thousands of times?

That's much clearer to me. In down periods, the wheel just minimizes your losses a bit (loss is stock loss, but you gain option premium). In good periods, the wheel caps your gains so you get a flattened distribution (max gain is the option premium).

Summary

Should you use this strategy? There's no perfect answer for that. In an extremely long bull market (like current), it's likely going to underperform. It does give you a bit of protection against drops and can do better in neutral markets. I personally don't like the thought of capping gains while not capping losses so this isn't a favorite of mine (see the fourth image with the crash and rebound to understand why that can be bad), but it's definitely viable if you want to smooth out your returns a bit.

Negative values with a log axis in Plotly

2021-05-21T20:15:00.004-07:00

Although log10(<any number less than or equal to 0>) is not defined, there are situations where you want to visualize data as if it were. How can you get plotly to do that? Another way of asking is 'how can you mimic symlog functionality in plotly?'

First...a real example of when you'd want this. Imagine you do the following:

generate a 1 GHz tone
measure amplitude at +/- 10 kHz, +/- 100 kHz, +/- 1 MHz, ...
generate a 2 GHz tone
measure amplitude at +/- 10 kHz, +/- 100 kHz, +/- 1 MHz, ...
want to overlay those offset amplitude curves

You could just plot vs absolute frequency to see one, but to overlay you need to center around a tone, and it just makes sense to show 'offset from tone' as the x axis. However, those steps imply a log scale.

Below is a working example of exactly this situation in plotly.js. I've included the ideal here with both positive and negative on a log scale, and the normal linear plot so that the difference in parsing it quickly is obvious:

See the Pen symlog approximation by Robert Hamner (@rhamner) on CodePen.

The basic algorithm is pretty simple:

Determine the max and min values and the value closest to zero; largest of max and abs(min) is upper bound...value closest to zero is lower
Split all traces into positive and negative (x values here since I just did this for x in the demo)
Create two x-axes: one for positive and one for negative

give both the same bounds
reverse the negative x-axis
assign ticks with positive values but negative labels to the negative x-axis
put a small buffer between them to represent that zero is undefined

Plot positive traces vs positive x-axis and negative traces vs negative x-axis, but make the negative x values positive

In that demo above you can just step through the javascript code and it should all be pretty clear.

If you want a slight variant of this that matches 'symlog' in matplotlib, just add a third, linear axis to connect these two instead of leaving a gap. I personally prefer the gap for this situation.

Simple way to see code coverage in python

2021-05-02T21:45:00.008-07:00

Sometimes you want to quickly see unit test coverage of your code. Coverage.py makes that really simple.

First, what do I mean by test coverage? Below is an example for a really simple usage:

That tells me how much of the code is executed when I executed my tests (start with test_* here). For that example, I just have two files with two methods in each. The files are identical and look like this:

I have one file with unit tests. I'm using pytest for this but coverage works with other test frameworks. Here is the unit test file:

With a file this simple, we can see some obvious test gaps like:

it doesn't test all functions in all files
it doesn't test all paths in the functions (e.g., no 0 test in file1)

You can imagine how hard it is to see that for any realistic code though. This is where coverage checks can help. Clicking into the coverage for file2 here we get:

Really simple to see that we missed the non-zero case in is_zero and that we didn't call the is_zero_wrapper function at all in any of the tested paths.

It is important to note that 100% coverage doesn't mean your code is perfectly tested and that less than 100% coverage doesn't mean your codebase is garbage. This is just one of many useful metrics for gauging test coverage and testing gaps.

To set this up and run it:

install coverage (e.g., 'pip install coverage')
install pytest (e.g., 'pip install pytest')
run 'coverage run -m pytest'
run 'coverage html'
open the index.html file in the htmlcov folder that it created

That index.html file is what I have in the screenshot at the start.

If you want to use my exact code to test this, it's available here.

Thinking in terms of probabilities

2021-04-06T21:11:00.009-07:00

We suck at probability. A common trap we fall into is failing to realize this and thinking in terms of absolutes.

What's an example of this? One I've encountered many times is something like 'if you work hard you can stop being poor' or 'everyone decides their own wealth'. Is this true? This is what I mean...there is no absolute yes or no answer. Consider the following plot (source data):

The following statements are all true:

some kids from the poorest households end up wealthy
some kids from the wealthiest households end up poor
most poor kids stay poor
most rich kids stay rich
parental wealth is a good predictor of your wealth as an adult

Many people will see someone claim that last bullet and jump to 'what about this guy that grew up poor and made it?' The plot clearly shows that's possible and doesn't negate the last bullet. Thinking in terms of what's likely is a better model for this.

Another common example is the classic 'it's cold today so global warming isn't real'. If you don't think of the distributions of temperatures, this is an easy fallacy to fall victim to. Here are two plots of temperature distributions for Denver, Colorado summer highs in 1900 and 2000 (respectively):

There are clearly cold (for summer) days in both. There is also a clear shift towards higher temperatures in 2000 vs 1900. That 'the distribution has shifted towards higher temperatures' is the best mental model for global warming in my opinion. If you want to see more of these I pulled them from this page.

This could go on forever but I hope the general idea is clear. Many things are distribution-based and can be understood much more easily if thought of in terms of 'how does this distribution shift/compare?'

If you're interested in a great book on this general topic, I liked 'The Drunkard's Walk'.

If the square root of -1 is i, what is the cube root of -1?

2021-02-07T22:11:00.002-08:00

You probably learned at some point that the square root of -1 is i. What about the cubed root of it? There's the obvious answer of (-1)^3 = -1, but the answer isn't actually that simple.

To answer this, we'll need Euler's identity which is:

\[e^{i\pi}=-1\]

Just take the cubed root of each side:

\[e^{i\pi*\frac{1}{3}}=-1^{\frac{1}{3}}\]

\[e^{\frac{i\pi}{3}}=-1^{\frac{1}{3}}\]

Now we just need the following definition:

\[e^{ix}=cos(x)+i*sin(x)\]

Plugging in our value:

\[e^{\frac{i\pi}{3}}=-1^{\frac{1}{3}}\]

\[cos(\frac{\pi}{3}) + i*sin(\frac{\pi}{3})=-1^{\frac{1}{3}}\]

\[\frac{1}{2} + i*\frac{\sqrt{3}}{2}=-1^{\frac{1}{3}}\]

And that's it...there's another cube root of -1.

What does that actually mean? Consider this coordinate system:

With real numbers on the horizontal axis and imaginary numbers on the vertical axis, you can draw complex numbers as vectors. This has a cool property. We got pi/3 radians as our angle there. That's equal to 60 degrees, or one-sixth of a full rotation. Looking at that coordinate system, if r = 1:

0 degrees = 1
90 degrees = i
180 degrees = -1
270 degrees = -i
360 degrees = 1
450 degrees = i
...

It rotates around. Since an angle of pi/3 represents 60 degrees, cubing the value with r = 1 and angle = 60 degrees gives you the same thing as r = 1 and angle = 180 degrees, which is -1.

Thinking through it a bit more, that's not unique. What if we used 300 degrees instead? Rotating by 300 degrees 3 times gives you 900 degrees which is just 2 revolutions + 180 degrees. Will that give you -1 also?

60 degree answer cubed:

\[(\frac{1}{2} + i*\frac{\sqrt{3}}{2})*(\frac{1}{2} + i*\frac{\sqrt{3}}{2})*(\frac{1}{2} + i*\frac{\sqrt{3}}{2})\]

\[(\frac{1}{4} + i*\frac{\sqrt{3}}{2} - \frac{3}{4})*(\frac{1}{2} + i*\frac{\sqrt{3}}{2})\]

\[\frac{1}{8} + i*\frac{\sqrt{3}}{8} + i*\frac{\sqrt{3}}{4} - \frac{3}{4} - \frac{3}{8} - i*\frac{3*\sqrt{3}}{8}\]

That adds up to -1 which is what we wanted.

300 degree answer cubed:

\[(\frac{1}{2} - i*\frac{\sqrt{3}}{2})*(\frac{1}{2} - i*\frac{\sqrt{3}}{2})*(\frac{1}{2} - i*\frac{\sqrt{3}}{2})\]

\[(\frac{1}{4} - i*\frac{\sqrt{3}}{2} + \frac{3}{4})*(\frac{1}{2} + i*\frac{\sqrt{3}}{2})\]

\[\frac{1}{8} - i*\frac{\sqrt{3}}{8} - i*\frac{\sqrt{3}}{4} + \frac{3}{4} + \frac{3}{8} + i*\frac{3*\sqrt{3}}{8}\]

That also adds up to -1 which is what we wanted. Finally, we have the -1^3 = -1 answer which is just the 180 degree one.

Thus, we found three cubed roots of -1: 0.5 + 0.866i, 0.5 - 0.866i, and -1.

For the one we all learned...'square root of -1 is i'...is that really the only answer? Doing a similar exercise, you want to end up at m*360 + 180 degrees after n rotations where n is the root and m is an integer. Here, n = 2. That means 2*rotation = m*360 + 180, or rotation = 180*m + 90. Start with m = 0. rotation = 90 which means i is an answer which we know. Try m = 1. rotation = 270 which means -i is answer. Trying that out...-i * -i = i^2 = -1. That works. Try m = 2. rotation = 450 which is just 90 + 1 full cycle, so we're repeating now. i and -i are our square roots of -1.

Regression Toward the Mean in the NFL

2021-01-24T22:49:00.009-08:00

I wanted to run some quick tests to see if regression toward the mean shows up clearly in NFL data.

Background

In case you aren't familiar, 'regression toward the mean' roughly means that if a random variable is an outlier, a future instance is likely to be closer to the mean. For a really simple model to make this easy to understand for something like NFL player performance, imagine that each player's performance is X% skill and Y% luck. If X is 100 and Y is 0, then previous years will nearly perfectly predict future years. If Y is 100 and X is 0, then there will be no relationship between performance from one year to the next. If X and Y are both between 0 and 100, there will be some relationship between performance from year to year but it won't be perfect.

There are two easy ways for me to look at this phenomenon:

plot one year's performance against the previous year's along with a line with a slope of 1 (X = 100%) and a best-fitting line
bin the data by previous year's performance and look at how each bin shifted in the next year

What might we see? There are many possibilities, but here are a few examples:

"Players that performed well perform even better the next season": plot 1 will show a slope greater than 1 and plot 2 will show the bottom bin doing worse and the top bin doing better
"Performance is driven by skill so it's the same year-to-year": plot 1 will show a slope of 1 and plot 2 will show all bins at roughly zero
"Performance is a mix of skill and luck so top performers will move back towards average and poor performers will move up towards average (this is the regression toward the mean case)": plot 1 will show a slope between 0 and 1, and plot 2 will show the bottom bin doing better and the top bin doing worse
"It's all random/luck": plot 1 will show a slope of ~0 and plot 2 will show all bins at roughly 0
"Poor performers overcompensate and end up better than average next season": plot 1 will show a slope less than 1 and plot 2 will show the bottom bin doing better and the top bin doing worse

To test it out I ran with 5 different stats using data from all starters from 2000-2020. For example, for a 2010-2011 compare, year 1 is 2010 and year 2 is 2011. You would expect the best performers in 2010 to do a bit worse in 2011, and the worst in 2010 to do a bit better in 2011. In the bar plots, the 'bottom third' means the 33% of players that were worst in season 1 from the plot above.

Results

and the data show regression toward the mean. Every stat I've tried (with a luck component obviously) followed the pattern above.

Fourier Series Animations

2021-01-15T22:00:00.003-08:00

It always seemed magical to me that you can get a square wave from adding together sine waves, so I threw together some animations of Fourier series.

Square Wave

Pulse

Parabola

Pulse Variation

How Should You Bet on a Biased Coin Toss?

2021-01-02T22:12:00.003-08:00

If you know a coin is biased to come up heads 75% of the time, what betting strategy should you use to bet on the outcome of a flip?

What might seem intuitive is to have a mixed strategy of 75% heads and 25% tails. Maybe something like 'flip an unbiased coin twice and bet tails if you get tails twice and heads for any other outcome'. What result will that give you?

There are four possibilities here:

Coin lands on heads and you bet heads (75%*75% = 56.25% of the time)
Coin lands on heads and you bet tails (75%*25% = 18.75% of the time)
Coin lands on tails and you bet heads (25%*75% = 18.75% of the time)
Coin lands on tails and you bet tails (25%*25% = 6.25% of the time)

1 and 4 are winning situations, so you'll win 62.5% of the time this way (just sum the 1 and 4 win rates).

You might immediately notice that 62.5% is less than 75%. What if you just always bet heads? Filling out the same list as above:

Coin lands on heads and you bet heads (75%*100% = 75% of the time)
Coin lands on heads and you bet tails (75%*0% = 0% of the time)
Coin lands on tails and you bet heads (25%*100% = 25% of the time)
Coin lands on tails and you bet tails (25%*0% = 0% of the time)

1 and 4 are winning situations, so you'll win 75% of the time this way. In this situation, the general win rate is:

(coin bias*head bet percentage) + [(1 - coin bias)*(1 - head bet percentage)] = win rate

We want to maximize this. Using b for 'coin bias' and h for 'heat bet percentage':

b*h + (1 - b)*(1 - h) = win rate

b*h + 1 - h - b + b*h = win rate

2*b*h + 1 - h - b = win rate

h*(2*b - 1) + 1 - b = win rate

At this point, we have an equation for a line. Win rate vs h is a line with a slope of 2*b - 1 and an x-intercept of 1 - b. Anytime 2*b - 1 is positive, this line will go up and to the right so h = 1 is the best bet (heads 100% of the time). Anytime 2*b - 1 is negative, h = 0 is the best bet (tails 100% of the time). 2*b - 1 is positive whenever b is greater than 0.5. Thus, the optimal strategy here is bet in the direction of the bias 100% of the time when you have a known, biased coin.

Making a CSS Flashlight Effect Using Conic-gradients

2020-12-27T22:02:00.004-08:00

This is just a quick tutorial of conic-gradients showing a flashlight effect with very little code.

The basic idea here is to use a conic-gradient and do the following:

set it to be the flashlight color and fairly transparent for the bright area (yellow from -25 to 25 degrees in the example here)
set it to be dark and fairly opaque for the dark area (black with 95% opacity from 25 to 335 degrees in the example here)
make the flashlight layer(s) fixed position and sit on top of the page
to keep it from starting as a point, offset it (vertical location of 110% in the example here puts it 10% below the bottom of the page)

And that's it...it's actually really simple. Here is a working example on top of a dummy html page:

See the Pen Flashlight by Robert Hamner (@rhamner) on CodePen.

It's really clean and requires no javascript. It's probably possible to make it cleaner. An obvious question you might have is 'can I make a flashlight that moves with the mouse?' and the answer is sure...simply set the gradient position to the cursor location (this requires javascript but is simple):

See the Pen Flashlight mouse by Robert Hamner (@rhamner) on CodePen.

All that took was adding a listener to the page for mouse or touch movements, updating --X and --Y variables on those events, and setting the conic-gradient position to be var(--X) var(--Y). Simple and looks pretty cool.

What Are the Most Impressive NFL Combine Performances Ever?

2020-12-11T22:03:00.011-08:00

If you combine the major tests and adjust for weight and height, which NFL player had the most impressive combine performance?

Data

Unfortunately, the modern combine hasn't existed for that long, so I only have data back to the year 2000. Still, that gives us a good-sized data set (~5000 players with at least some data).

To try to find 'best performance ever', I wanted to do two things:

Adjust for weight and height...a 200 lb guy running a 4.5 forty is way less impressive than a 250 lb guy doing it.
Try to combine all metrics...a 200 lb guy running a 4.5 forty and getting 4 bench reps is way less impressive than a 200 lb running a 4.5 forty and getting 24 bench reps.

The metrics that seem to be available for most people are:

40 yard dash time
bench press reps (number of times they bench press 225 lbs)
broad jump
vertical jump

So I used those.

Calculation

To calculate this, I used a three step process:

Perform linear regression for each metric using weight and height as inputs ('metric = C1*weight + C2*height + C3').
Divide actual value by value predicted from the regression for each metric to get a score. E.g., if a player ran a 4.5 40 and the model predicted a 4.7 one for his weight and height, he'd get 4.5/4.7, or 0.957 for that metric.
Calculate an overall score that's a weighted rss of the individual scores. The weights are 1, 1/5, 1/2, 1/2 for the four metrics in that order.

It doesn't affect the calculation much, but throughout, I use weight as an input for everything but bench reps, and weight^2/3 as an input for bench reps.

Results

Using the calculation described above, these are the greatest combine performances (actual value to the left and predicted value in parentheses to the right):

player	40 time (s)	bench (reps)	broad (inches)	vertical (inches)
Vernon Davis	4.38 (4.82)	33 (21)	128 (114)	42 (33)
Terna Nande	4.51 (4.70)	41 (20)	124 (115)	39 (34)
Vic Beasley	4.53 (4.78)	35 (20)	130 (115)	41 (33)
Mario Williams	4.70 (5.06)	35 (23)	120 (109)	40 (30)
Cornelius Washington	4.55 (4.89)	36 (22)	128 (112)	39 (32)
Myles Garrett	4.64 (4.93)	33 (23)	128 (111)	41 (31)
Nick Perry	4.55 (4.92)	35 (23)	124 (111)	38 (31)
Margus Hunt	4.62 (4.95)	38 (21)	121 (113)	34 (32)
D.K. Metcalf	4.33 (4.67)	27 (18)	134 (118)	40 (34)
Jerick McKinnon	4.41 (4.57)	32 (19)	132 (117)	40 (35)
Davis Tull	4.57 (4.78)	26 (21)	132 (114)	42 (33)
Jon Alston	4.50 (4.65)	30 (19)	132 (118)	40 (34)
Vernon Gholston	4.65 (4.89)	37 (23)	125 (112)	36 (32)
Sean Weatherspoon	4.62 (4.74)	34 (21)	123 (115)	40 (33)
Demario Davis	4.49 (4.71)	32 (19)	124 (116)	38 (34)
Scott Young	5.08 (5.15)	43 (27)	115 (104)	35 (29)
Michael Johnson	4.61 (4.89)	28 (20)	128 (114)	38 (32)
Alex Barnes	4.59 (4.66)	34 (20)	126 (117)	38 (34)
Benjamin Watson	4.50 (4.85)	34 (22)	123 (113)	36 (32)
Virgil Green	4.54 (4.79)	23 (20)	130 (115)	42 (33)

#1 there did not surprise me. Vernon Davis's 40 time is pretty well known as an insane combine performance.

The first really odd one in that list is actually #2, Terna Nande. He had an extremely short NFL career with a single tackle in his entire career. However, at just 230 pounds he pulled off 41 reps on the bench, and all of his other performances were above average. No other non-lineman in history has gotten more than 40 reps. The rest of the top few had or are currently having pretty good NFL careers.

Since the 40 time is the one that seems most discussed, here is the same analysis if you use only the 40 time to rank:

player	weight (lbs)	40 time (s)
Montez Sweat	260	4.41 (4.86)
Vernon Davis	254	4.38 (4.82)
Bryan Thomas	266	4.47 (4.89)
Dontari Poe	346	4.89 (5.35)
Dwight Freeney	266	4.48 (4.89)
Tank Johnson	304	4.69 (5.11)
Calvin Johnson	239	4.35 (4.74)
Dontay Moch	248	4.40 (4.79)
Matt Jones	242	4.37 (4.75)
Bruce Campbell	314	4.75 (5.16)
Taylor Mays	230	4.31 (4.69)
Terron Armstead	306	4.71 (5.12)
James Hanna	252	4.43 (4.81)
Martez Wilson	250	4.42 (4.80)
T.J. Duckett	254	4.45 (4.82)
Bruce Irvin	245	4.41 (4.77)
Rashan Gary	277	4.58 (4.95)
Connor Barwin	256	4.47 (4.83)
Nick Perry	271	4.55 (4.92)
Lane Johnson	303	4.72 (5.10)

It's interesting looking through both of these that the really legendary players aren't at the top. Many of them are good players, but Calvin Johnson and J.J Watt are the only ones near the top in either table that will definitely go down as all-time greats. Aaron Donald, Derrick Henry, etc. had above average combine performances but some that did clearly better went on to worse careers.

I was curious about that and decided to go in the other direction. What great players had bad combine performances? To do that, I took all all-pro players and matched with names from the combine, and the worst were Max Unger, Tyrann Mathieu, and Tarik Cohen. All under-performed estimates in every metric here. The worst performance from an all-time great here was Adrian Peterson. He was roughly average, but I would have guessed his 40 time was way better (4.68 s).

Split Violin Plots in plotly.js

2020-12-04T22:13:00.004-08:00

Split violins are a cool way to compare distributions, and plotly makes them simple.

There isn't much to explain here. I've embedded an example below showing how to use it. You just make a normal violin plot, but specify one trace as the negative side and another as the positive side, and plotly handles the rest.

See the Pen split violins in plotly js by Robert Hamner (@rhamner) on CodePen.

If you've ever wanted to plot multiple distributions side-by-side, this is an easy option.

How Long Until My Investments Start Making Money?

2020-11-17T22:14:00.023-08:00

Say you invest some fixed amount of money every year. How long does it take for the investments to grow faster than the amount you're putting into them?

Basic math problem

You invest $X per year in an account that yields R in gains. How long does it take for the gain in a year to be greater than $X?

Another way of asking this is 'when does R times the future value of investing X each year exceed X?'

The future value of a regular yearly investment where N = number of years, X = yearly investment, and R = growth rate of the investment is:

\[FV = \frac{X*((1+R)^N - 1)}{R}\]

What we're looking for is the number of years it takes for R times that to exceed X. That is, we want to solve:

\[\frac{R*X*((1+R)^N - 1)}{R} > X\]

Noticing that the R's cancel in numerator and denominator and dividing both sides by X you get:

\[((1+R)^N - 1) > 1\]

Adding 1 to both sides:

\[((1+R)^N) > 2\]

Simplifying:

\[log((1+R)^N) > log(2)\]

\[N*log((1+R)) > log(2)\]

\[N > \frac{log(2)}{log(1+R)}\]

This is kind of cool. You might not recognize it right away, but that says 'N is greater than the doubling time of the investment'. It's really cool that it works out that way. Because of some pretty good approximations that work out, that means that the investment growth takes over the new money moved in after roughly '72 divided by annual interest rate' years.

For a quick concrete example of what this means...say you invest $10,000 per year into an account yielding 6%. The time it takes for the 6% yield each year to exceed $10,000 is log(2)/log(1.06) which is ~12 years.

Simple plot

Here's a simple interactive plot showing the breakdown between money invested and money from gains for an annual $10,000 investment using the interest rate that you enter below:

Interest rate (%)

How Do American Betting Odds Convert to Percent Chance?

2020-11-01T22:30:00.012-08:00

If you've looked at betting odds, you've probably seen something like +140 and -175. What % chance does that imply for each participant?

Definition

First, what do those numbers mean? A -175 means 'you win $100 for each $175 that you bet' and a +140 means 'you win $140 for each $100 that you bet'.

Example

Now, consider a matchup that's 60% chance for A and 40% chance for B. What does that convert to?

Assuming no cost to bet, if there were 10 matches and you bet $100 on A each time, you'd put in $1000 and expect to get out $1000. Since A wins 60% of the time, you'd get 6 payouts and they would sum to $1000 (since you lose the bet on the 40% where A loses). Each bet would pay out $167, and subtracting off the initial $100 means a profit of $67. Thus, a $100 bet on A yields a profit of $67 when A wins which means that to get a profit of $100 you'd bet 100/.67, or $150. From the definition above, that means that a 60% chance of winning is a line of -150.

Doing the same with the 40% one, you'd get 4 payouts that sum to $1000, so $250 per payout and a profit of $150. You bet $100, and profit $150 on a win, so the line is +150.

Thus, a 60/40 matchup corresponds to a line of -150/+150. The general equation for the logic above is:

favorite: American line = - 100/[( 1/percent - 1)]
underdog: American line = 100*[( 1/percent - 1)]

Going the other direction:

favorite: percent = 1/[(100/-American) + 1]
underdog: percent = 1/[(American/100) + 1]

Real Life

It's not quite this easy. The person offering the bets (bookie) needs to make money. Imagine in the above that the person offering the bet wants to make $10 for every $100 bet. How does that change things?

Consider bets on A. You make 10 bets on A. A wins 60% of the time, so you should get $1000 back like before except that you pay $10 per bet so you get $900 back. 6 payout that give $900 means $150 per payout and subtracting initial investment means $50 profit. That means you'd bet 100/0.5, or $200 for each $100 profit which means the line is -200.

For B...4 payouts gets $900, so $225 payout per win which is $125 profit per win after subtracting initial investment. $125 profit on a $100 bet means +125 is the line.

How can you factor out this $10 cost (margin)?

You get a line of -200/+125 to start and want to see what the margin is on this. It's actually easy from what we did earlier. Simply convert these lines to the percentage versions, and add them together. Taking these specific numbers:

-200 => 66.67%
+125 => 44.44%
sum = 111.11%

That is, for every $100 that is bet, the bookie gets $11.11 (or in the earlier terms, for every $90 that is bet you pay an additional $10).

Finally...how do you get the implied chance of each option winning from odds that have the margin factored in like these? Simply divide each percentage by the sum.

favorite: 66.67% / 1.1111 = 60%
underdog: 44.44% / 1.1111 = 40%

And we recovered the original odds.

You can play with this in a spreadsheet here if you want.

Before (ms)	After (ms)
241	272
224	211
202	226
243	234
246	205
229	279
209	208
231	212
258	218
287	198
270	215
262	244
227	215
200	175
291	220
290	218
184	218
319	247
250	245
229	199

Before (ms)	After (ms)
241	272
224	211
202	226
243	234
246	205
229	279
209	208
231	212
258	218
287	198
270	215
262	244
227	215
200	175
291	220
290	218
184	218
319	247
250	245
229	199

Before (ms)	After (ms)
241	272
224	211
202	226
243	234
246	205
229	279
209	208
231	212
258	218
287	198
270	215
262	244
227	215
200	175
291	220
290	218
184	218
319	247
250	245
229	199