How much is a NHL Draft Pick Worth?

The NFL has a famous chart for quantitatively assigning a value to a draft pick (NFL Draft Pick Chart). Could something similar be made for hockey to assign values to NHL draft picks?

The Data

I scraped data NHL draft data from the 2000 draft to the 2010 draft from Elite Prospects. I collected player statistics from 2,741 players that were drafted during this time. To normalize the data, I only grabbed players that were drafted in the first 210 picks, as the 2010 draft only had 7 rounds and previous drafts had more rounds. I am also going to be using points per game to compare players, so I am going to limit my exploration to only forwards. This left me with 1,324 players to analyze.

Exploration

I first looked at the total points by draft class.
./images/Value_of_Draft_Pick/total_points_by_year.png

We see there is a little fluctuation in the total points by draft year. Now I want to see what percentage of players drafted played in the NHL.
./images/Value_of_Draft_Pick/percent_in_nhl_year.png

We see that the number of players that play in the NHL varies between 40% and around 60%. Now I am going to see how the Percentage of Players varies by the round they were drafted.
./images/Value_of_Draft_Pick/percent_by_round.png

We see that players drafted in the first round over 90% end up playing in the NHL, while players drafted in the 7th round only about half end up playing a single game in the NHL. Now that we looked at who makes the NHL, lets look at who is most successful in the NHL. To calculate success I am going to use points per game as my metric. First I will look at points per game by round drafted.
./images/Value_of_Draft_Pick/ppg_by_round_drafted.png
We see there is a steep decline in points per game after the first round. This decline continues all the way to round 7, however after round 4 the decline is very minimal. Now I am going to see how points per game changes based on pick in the first round.
./images/Value_of_Draft_Pick/ppg_by_pick.png

After the first pick the PPG dips down significantly. We see a little bit of a plateau between picks 3 to 5, after pick 5 it looks like white noise all the way to pick 30. This tells me that after pick 5 there is a lot of randomness in selecting players. If given the opportunity, trade up for a top 5 pick and if possible trade up for the number one overall pick.

Modeling Pick Importance

We see that earlier picks are more valuable, but just how much more valuable are they? To start I built regression model where each pick was treated as a categorical variable. I then plotted out the coefficient for each pick.
./images/Value_of_Draft_Pick/expected_ppg_by_time.png
We see that the expected points per game decreases rapidly after the 1st selection and is very random after the first 30 selections. This does not incorporate the risk of the selection, we previously saw that picks in the later round have fewer players that make the NHL. To incorporate the risk of the selection, I am going to divide each coefficient (commonly referred to as the T-Statistic).
\(T Statistic = \frac{coefficient}{standard error}\)
./images/Value_of_Draft_Pick/expected_normalized_by_risk.png

We now see that the first overall pick is significantly higher than every other pick, this is because there is significantly more risk with future picks. For a comparison, lets look at the number one overall picks during this time

PlayerPPG
Sidney Crosby1.28
Alex Ovechkin1.11
Patrick Kane1.05
Steven Stamkos1.04
Ilya Kovalchuk0.95
John Tavares0.94
Taylor Hall0.90
Rick Nash0.76

Most of these players ended up as legitimate superstars (as a reminder Rick Nash had injury problems). Now lets compare this to the 2 overall picks

PlayerPPG
Evgeni Malkin1.19
Dany Heatley0.91
Tyler Seguin0.86
Jason Spezza0.84
Eric Staal0.82
Bobby Ryan0.67
James Van Riemsdyk0.65
Jordan Staal0.56

We see that besides Malkin, the majority of of #1 draft picks having significantly more points per game than #2 overall draft picks.

Finally, to smooth out the randomness of the normalized expected points per game I am going to fit a polynomial regression to the data. I am using a polynomial regression due to the relationship not being linear.
./images/Value_of_Draft_Pick/all_picks1.png

We see that the line smooths out some of the randomness, however we still see a little randomness at the end. As I expand the search this will smooth out. Now lets only look at only the first round. This model gives me a \(R^2\) of 0.77.
./images/Value_of_Draft_Pick/first_round.png

We see after the first overall selection, the expected normalized points is consistent. Here are the ratings for the first 30 selections

PickValuePickValuePickValue
115.94115.28213.42
27.92125.04223.28
37.57134.82233.15
47.24144.62243.03
56.92154.42252.91
66.61164.23262.80
76.04174.05272.70
86.04183.88282.61
95.77193.71292.52
105.52203.56302.43

Key Takeaways

The number one overall pick is worth a lot and is probably being undervalued. If you have the means to trade for this pick, you should do it.