From the looks of it, there's a lot to talk about, so let's see what we can learn.
What is the QC Telling Us?
If I summarize this chart in one phrase, it would be the title of a Beach Boys hit single: Good Vibrations. From January to February, we see a lifting of the entire QC (except for a few spots), which is a good thing. From February to the present, we've seen the QC vibrate up and down, but roughly in the same range. Let's breakdown when parts of the range reached their peak:
- February Peaks: #1, #4, #11-13, #30-36, #38-43
- March Peaks: #2, #3, #7, #25, #27-29, #37
- Final Peaks: #5, #6, #8-10, #14-24, #26, #44-50
Now, let's look at how 2019 compares to the last two years, and this is a valuable comparison since the last two years provide an excellent QC dichotomy.
If the last chart showed signs of steepness (and likely bracket sanity), this one screams it even louder. Let's break it down into sections.
- The 2019 #1-10 spots are stronger than either 2017 or 2018. For 2018, that statement doesn't mean much, but for 2017, it speaks volumes. Judging that 2017 and 2018 look similar from the #1-6 spots, it is ironic that both years saw a 1-seed and two 2-seeds lose in the R32 (not to mention the 1-seed that lost in the R64). If 2019 is far stronger than either of those years, I think it is safe to predict that no more than two upsets will happen to 1- and 2-seeds collectively before the S16 (I do have my eye on two potential upsets, but I can't make those based on the QC/SC analysis).
- The #11-27 spots are literally stuck in the middle of 2017 and 2018. The sheer strength of this range in 2017 saw four upsets in the R64 (one for a 5-seed and three for 6-seeds) and one upset in the R32 (a 3-seed to an 11-seed). The sheer weakness of this range in 2018 saw four upsets in the R64 (two 4-seeds and two 6-seeds) and two upsets in the R32 (both were 3-seeds to 11-seeds). These totals look like safe targets, but we need to review the SC before making it official: Four upsets in the R64 and either one or two upsets in the R32 for for 3- thru 6-seed.
- The 2019 #28-45 spots are stronger than either 2017 or 2018. This range of teams includes one 7-seed, all four 8-seeds, two 9-seeds, two 10-seeds, two 11-seeds, a 12-seed, and six NIT-eligible teams (lol, just seeing if you are paying attention). This is interesting because the two potential Cinderellas I mentioned above are in this group, but I've given too much information for anyone to decipher the identity of these two specific teams.
- The 2019 #46-50 spots are mostly weaker than 2017 or 2018, but as I mentioned above, three of these teams are over-seeds and the other two are not in the NCAA tourney. I'm not too worried about these five teams.
The 2019 Seed Curve
Wow, what a thing of beauty! Okay, platitudes aside, let's get down to business. There's only two things I'm going to point out from this chart.
- The same steepness that defines the QC also can be seen in the SC. There are slight curvatures from the 2- to 5-seeds and from the 7- to 10-seeds, but only the latter seems to have significant bowing. Bowing in the SC helps identify areas of over-seeding and under-seeding (if teams were seeded directly by advanced metrics, the SC would have linear properties instead of quadratic). There is also a spike at the 12-seed, which either indicates severe weakness in the 11-seed group or under-seeded (and potentially Cinderella) 12-seeds.
- The second point I want to emphasize is lack of dispersion within the seed-group. I've never mentioned this before, but I have kept an eye on it over the last few years. When the four members (sometimes more for seed groups with play-in members) show little dispersion from the seed-group average, good things happen for the seed-group. In 2017, the 3-seed group was tightly centered around the mean, and one 3-seed made the F4 and two more met seed expectations by reaching the S16. Also in 2017, the 9-seeds were tight around the mean, but they were also the trough of a major dip, so only one 9-seed won in the R64. In 2018 (Link), 5-seeds and 9-seeds were centered tightly around the mean: All 5-seeds defeated their 12-seed counterparts with three of the four reaching the S16, and three 9-seeds defeated their 8-seed counterparts with two of them reaching the E8. Also in 2018, 8-seeds were tightly centered around their mean, but unfortunately both 8-seeds and 9-seeds cannot advance simultaneously. In 2019, we see the 6-seeds, 8-seeds, and 9-seeds centered tightly around their mean. The only question that remains is what the result will be: A deep run by one of the members of the seed-group or all four members of the seed-group advancing past their counterpart. I present both options to you because I'm not certain which will happen, nor do I have any tools to determine which will result. I am fairly confident that good things should happen to the 6-seed group and either the 8- or the 9-seed group.
As troubling as last year's QC-SC overlay, this year looks even tighter than last year. The Top-28 teams in both the QC and the SC share 27 teams in common, meaning the 1- thru 7-seeds are approximately close to their true seeding (this can also be seen on the OS/US model). In fact, all 1- thru 7-seeds are within two seed-lines of its true seed (determined by advanced metrics). The assumption made last year is the tightness of the QC-SC overlay means upsets are less likely as seeds are indicative of (or approximate to) true team quality. The locations along the curve where the SC rises above the QC are high-probability targets for out-performing seed expectations. Last year, 2-seeds, 3-seeds and 5-seeds were high-probability targets for out-performance, and only the 2-seed group failed to exceed expectations (Expected 12 wins vs Actual 9 wins). According to the 2019 SC, the 2-seed, the 5-seed and the 7-seed group look to be high-probability targets for out-performing seed expectations.
Concluding Thoughts
Typically, I provide my full predictions for round-by-round upsets and the Aggregation Model in this section of the article. Instead, I'm going to save that for a final article to be published late Wednesday night or early Thursday morning. I want to get the results of a few more models before I make a final call. If you don't want to wait for those numbers or want to make your own attempt at a prediction, I do like the 2015 SC and the 2003 SC models (Link) to be close approximations for the 2019 tourney. As always, thanks for reading my work, and I am still committed to my BCT schedule.
Big fan of your work. thank you for what you do.
ReplyDeleteWhen you say I do have my eye on two potential upsets, but I can't make those based on the QC/SC analysis.
-what do you use to look for potential upsets? offensive rebounding? turnover rates forced per possession? or most of the stats on here https://www.teamrankings.com/ncaa-basketball/stat/offensive-rebounding-pct?
For me, it's more about process. I don't really place a lot of emphasis on one tool or one stat category.
Delete1. I look at the big picture first: chalk vs crazy, QC/SC analysis and Agg/Upset Model estimation (curve-fitting)
2. Find safe favorites for deep runs (Note: This has been around 50% accurate for the last three years, which I'm not happy about)
3. The remaining non-favorites get evaluated via numerous systems (Pete's rules, OS/US and Vegas for R64, other models never mentioned on my blog).
To see how my process works with an example: Many identified UVA, KU and UNC in 2018. I went with UVA losing to UK (1 to 5), KU losing to HALL (1 to 8), and UNC losing to TXAM (2 to 7). You know how each of those played out. Likewise in 2018, I favored 2-seeds, 3-seeds, and 5-seeds, and they went 12-0 in R64 and 7-5 in R32. My process is better at avoiding risks than going for the jugular, if that makes sense.