<?xml version="1.0" encoding="UTF-8"?>
<blog-post>
  <updated-at type="datetime">2009-10-08T11:58:40-04:00</updated-at>
  <title>Behind the Scoreboard: How can you tell if a playoff team will win?</title>
  <published-at type="datetime">2009-10-05T12:53:56-04:00</published-at>
  <comments-count type="integer">9</comments-count>
  <created-at type="datetime">2009-10-05T12:53:56-04:00</created-at>
  <comments-page>
    <current-page type="integer">1</current-page>
    <total-pages type="integer">1</total-pages>
    <per-page type="integer">10</per-page>
    <comments type="array">
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-06T17:58:06-04:00</created-at>
        <user>
          <image nil="true"></image>
          <comments-count type="integer">6</comments-count>
          <state>MO</state>
          <display-name>MrKid72</display-name>
          <city>Eureka                      </city>
          <id type="integer">576346</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">9</commentable-sequence>
        <body>I think that looking at the whole season to try to predict the post-season is a big problem because:

1.  Teams switch from a 5-man to a 4 (or sometimes 3) man rotation in the postseason;

2.  There are personnel differences between the regular season and the post season due to injuries, trades, etc.

A 25-man roster becomes effectively about an 18-man roster in the post-season.  Somebody needs to figure out how to compare the top 18 (9 position players, 9 pitchers) on each team, and scrape off the stats from the bottom of the barrel players who won't be appearing in the game unless they are more than 8 runs ahead or behind.</body>
        <id type="integer">7144087</id>
      </comment>
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-06T12:59:39-04:00</created-at>
        <user>
          <image nil="true"></image>
          <comments-count type="integer">7</comments-count>
          <state>OH</state>
          <display-name>Big Dawg 69</display-name>
          <city>Greenville                  </city>
          <id type="integer">63737</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">8</commentable-sequence>
        <body>Dodgers win and Torre moons the Steinbrenners from the mound--or not.</body>
        <id type="integer">7137880</id>
      </comment>
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-06T12:22:32-04:00</created-at>
        <user>
          <image nil="true"></image>
          <comments-count type="integer">6</comments-count>
          <state>NJ</state>
          <display-name>NL East Fan</display-name>
          <city>Trenton                     </city>
          <id type="integer">683695</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">7</commentable-sequence>
        <body>Hey Eyepatch, you need to get that crystal ball calabrated. If it was working right, it would show the Yankees losing in five to the Phillies.</body>
        <id type="integer">7137022</id>
      </comment>
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-06T05:36:19-04:00</created-at>
        <user>
          <image>http://img.fannation.com/upload/user_profile/image/634/226/thumb/616px-DarthNihl.jpg</image>
          <comments-count type="integer">769</comments-count>
          <state>UT</state>
          <display-name>THEeyepatch</display-name>
          <city>Salt Lake City              </city>
          <id type="integer">634866</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">6</commentable-sequence>
        <body>You know what..... I will just use my crystal ball to predict..... hold on...... seeing something.............. wait a second.............. YANKEES WIN WORLD SERIES!!!!</body>
        <id type="integer">7130694</id>
      </comment>
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-06T02:06:38-04:00</created-at>
        <user>
          <image nil="true"></image>
          <comments-count type="integer">1</comments-count>
          <state>MO</state>
          <display-name>meyds90</display-name>
          <city>Florissant                  </city>
          <id type="integer">883528</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">5</commentable-sequence>
        <body>How did you account for the fact that your two independent variables, a team's record and home field advantage, are correlated? The team with the higher record in the playoffs is by rule given home field advantage, is it not?</body>
        <id type="integer">7130679</id>
      </comment>
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-06T01:27:21-04:00</created-at>
        <user>
          <image nil="true"></image>
          <comments-count type="integer">12</comments-count>
          <state>mi</state>
          <display-name>prettyrickymyers</display-name>
          <city>wyoming</city>
          <id type="integer">512459</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">4</commentable-sequence>
        <body>way to make something simple sound really complex</body>
        <id type="integer">7130573</id>
      </comment>
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-05T23:55:50-04:00</created-at>
        <user>
          <image nil="true"></image>
          <comments-count type="integer">87</comments-count>
          <state>CO</state>
          <display-name>RCR</display-name>
          <city>Aurora                      </city>
          <id type="integer">532319</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">3</commentable-sequence>
        <body>Sa-weet article! Thanks for the insight. I am looking foward to the refinements in the playoff model.</body>
        <id type="integer">7130365</id>
      </comment>
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-05T21:14:03-04:00</created-at>
        <user>
          <image nil="true"></image>
          <comments-count type="integer">2</comments-count>
          <state>IL</state>
          <display-name>clarkandaddison</display-name>
          <city>Chicago                     </city>
          <id type="integer">883428</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">2</commentable-sequence>
        <body>great job.  looking forward to your future analysis.</body>
        <id type="integer">7128994</id>
      </comment>
      <comment>
        <quotable>
        </quotable>
        <created-at>2009-10-05T20:24:55-04:00</created-at>
        <user>
          <image nil="true"></image>
          <comments-count type="integer">5</comments-count>
          <state>XX</state>
          <display-name>submariner</display-name>
          <city>No City</city>
          <id type="integer">835813</id>
        </user>
        <quoted-text nil="true"></quoted-text>
        <commentable-sequence type="integer">1</commentable-sequence>
        <body>doo doo del toro</body>
        <id type="integer">7128725</id>
      </comment>
    </comments>
    <total-entries type="integer">9</total-entries>
  </comments-page>
  <body>&lt;p&gt;&lt;em&gt;&lt;strong&gt;Sky Andrecheck&lt;/strong&gt; writes for the &lt;a href=&quot;http://www.hardballtimes.com/&quot;&gt;Hardball Times&lt;/a&gt;, an awesome site that you should be reading if you aren't already. He'll be a regular contributor to this blog throughout the postseason&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;October is here, and while 162 games have given us an idea of who the top teams are in the regular season, the playoffs are something else, widely and accurately seen as a crapshoot in which even the best teams need a lot of luck to win.  (Since the beginning of the wild-card era, the team with the best record has gone on to win the World Series just twice in 14 years -- Red Sox in '07 and Yanks in '98.)  So should we throw the regular-season stats out the window when it comes to the postseason?  Or is there something in teams' regular-season play that can help us predict postseason success after all?&lt;/p&gt;
&lt;p&gt;You've heard the cliches: Good pitching beats good hitting, &quot;small ball&quot; teams usually do well, teams that live and die by the home run usually die by it in the playoffs and home-field advantage takes on vastly added importance in October.  Do any of these theories have merit, or are they baseball's equivalents to old wives' tales?&lt;/p&gt;
&lt;p&gt;I examined all postseason games stretching back to 1969 and looked at a variety of team statistics: The regular-season winning percentages of the two teams, the teams' Pythagorean records, the strength of each team's pitching staff (including runs allowed, strikeouts and walks), and the strength of each team's offense (including runs scored, home runs, walks, strikeouts and stolen bases).&lt;/p&gt;
&lt;p&gt;Putting these statistics into a model, we can determine which factors are important in winning postseason games.  (For those statistically minded, it's a &lt;a href=&quot;http://en.wikipedia.org/wiki/Logistic_regression&quot;&gt;logistic regression model&lt;/a&gt;, designed to model binary outcomes -- in this case whether the team in question won or lost.)  This allows us to predict outcomes based on a variety of different characteristics of each team.&lt;/p&gt;
&lt;p&gt;As it turns out, very few of the commonly held theories are true.  Think that teams who steal a lot of bases have an advantage in the playoffs?  When controlling for other factors, the team with the speed advantage did not perform any better in the playoffs.  How about the &quot;good pitching beats good hitting&quot; mantra?  Again, when taking into account the records of the two teams, teams that thrived via pitching and defense fared no better than the slugging teams.  Think that power staffs that dominate with strikeouts are ripe for postseason success?  The team with the power pitching advantage gained no special edge.  And, when accounting for other factors, the team that won via the long ball fared no better or worse than its punch-and-judy counterparts.  In fact, none of the team characteristics had a &lt;a href=&quot;http://en.wikipedia.org/wiki/P-value&quot; target=&quot;_blank&quot;&gt;statistical p-value&lt;/a&gt; that was close to statistically significant.&lt;/p&gt;
&lt;p&gt;How about a team's &lt;a href=&quot;http://en.wikipedia.org/wiki/Pythagorean_expectation&quot;&gt;Pythagorean record&lt;/a&gt;? Supposedly a more consistent indicator of team performance than vanilla run differential, is it actually an indicator of postseason success?  Regular records and Pythagorean records are highly correlated, so it's difficult to separate the two, but the model clearly preferred looking at a team's regular record.  The model suggests giving a team's regular record about 80 percent of the weight, and giving only 20 percent of the weight to the Pythagorean record.  However, the Pythagorean record is not statistically significant in the model, meaning that you'll probably have nearly as good of prognosticating ability simply using a team's regular-season record alone.&lt;/p&gt;
&lt;p&gt;So what &lt;em&gt;does &lt;/em&gt;make a difference?  Sadly (though unsurprisingly), there's no magic bullet.  Simply put, teams that win in the regular season have a better shot at winning in the postseason.  After debunking the myths, the model says that just two main factors matter: The winning percentage of the two teams and home-field advantage.  When controlling for the records of the two teams, home field advantage boosts a team's probability of victory by about 6.2 percent, which is slightly higher than the regular-season home field advantage, which boosts the home team by about 4 percent.  Therefore, in a playoff game between two evenly matched teams, the home team will have about a 56 percent chance to win.&lt;/p&gt;
&lt;p&gt;Additionally, each game ahead in the standings boosts a team's chances of winning by about 0.7 percent.  Therefore, in a playoff game between a 100-win team and a 90-win team, we would expect the 100-win team to win about 57 percent of the time.&lt;/p&gt;
&lt;p&gt;So, what's the quick and dirty way of determining the probability of winning a postseason game?  You can use this simplified formula:&lt;em&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Chance of Team A Winning = 44% + (Team A Wins - Team B Wins)*0.7% (+ 12% if Team A is home)&lt;/em&gt;.  A graph below shows the teams' probability of winning based on the two factors.&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://img.fannation.com/upload/si_blog_post_images/43841/andrecheckprob2.gif&quot; alt=&quot;Andrecheckprob2&quot; width=&quot;500&quot; height=&quot;315&quot; /&gt;&lt;/p&gt;
&lt;p&gt;Perhaps more interesting is a graph showing the probability of winning an entire five-game series, not just one game.  In this case, the home-field advantage is not quite as important, but the differences between the two teams are magnified.&lt;/p&gt;
&lt;p&gt;&lt;img src=&quot;http://img.fannation.com/upload/si_blog_post_images/43861/andrecheckprob1.gif&quot; alt=&quot;Andrecheckprob1&quot; width=&quot;500&quot; height=&quot;276&quot; /&gt;&lt;/p&gt;
&lt;p&gt;So, what does the blind model think of this year's postseason?  The probabilities are as follows:&lt;/p&gt;
&lt;p&gt;NYY 67 percent chance of victory vs. DET/MIN&lt;br /&gt;LAA 55 percent chance of victory vs. BOS&lt;br /&gt;LAD 58 percent chance of victory vs. STL&lt;br /&gt;PHI 54 percent chance of victory vs. COL&lt;/p&gt;
&lt;p&gt;Of course, this model doesn't explain it all.  As we'll be talking about throughout the postseason, many other factors can sway these probabilities.  Personnel changes such as midseason pickups and injuries can sway things, as can a top-heavy starting rotation and bullpen.  And then, of course, there is the question of how particular teams match up with each other.  Still, the blind model can be a good starting point in predicting the fate of each postseason series.  We'll get more refined in the days to come.&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Sky Andrecheck writes for &lt;a href=&quot;http://www.hardballtimes.com/&quot;&gt;HardballTimes.com&lt;/a&gt; and is an author in the &lt;/em&gt;&lt;a href=&quot;http://www.actasports.com/detail.html?&amp;amp;id=080&quot;&gt;Hardball Times 2010 Baseball Annual&lt;/a&gt;&lt;em&gt;.  He also writes regularly for &lt;a href=&quot;http://baseballanalysts.com/&quot;&gt;BaseballAnalysts.com&lt;/a&gt;.  He is a statistician by trade.&lt;/em&gt;&lt;/p&gt;</body>
  <blogger>
    <image nil="true"></image>
    <comments-count type="integer">1102</comments-count>
    <state>NY</state>
    <display-name>The Moderator</display-name>
    <city>New York</city>
    <id type="integer">7</id>
  </blogger>
  <id type="integer">79803</id>
  <intro>&lt;p&gt;&lt;em&gt;&lt;strong&gt;Sky Andrecheck&lt;/strong&gt; writes for the &lt;a href=&quot;http://www.hardballtimes.com/&quot;&gt;Hardball Times&lt;/a&gt;, an awesome site that you should be reading if you aren't already. He'll be a regular contributor to this blog throughout the postseason&lt;/em&gt;.&lt;/p&gt;
&lt;p&gt;October is here, and while 162 games have given us an idea of who the top teams are in the regular season, the playoffs are something else, widely and accurately seen as a crapshoot in which even the best teams need a lot of luck to win.  (Since the beginning of the wild-card era, the team with the best record has gone on to win the World Series just twice in 14 years -- Red Sox in '07 and Yanks in '98.)  So should we throw the regular-season stats out the window when it comes to the postseason?  Or is there something in teams' regular-season play that can help us predict postseason success after all?&lt;/p&gt;
&lt;p&gt;You've heard the cliches: Good pitching beats good hitting, &quot;small ball&quot; teams usually do well, teams that live and die by the home run usually die by it in the playoffs and home-field advantage takes on vastly added importance in October.  Do any of these theories have merit, or are they baseball's equivalents to old wives' tales?&lt;/p&gt;
&lt;p&gt;I examined all postseason games stretching back to 1969 and looked at a variety of team statistics: The regular-season winning percentages of the two teams, the teams' Pythagorean records, the strength of each team's pitching staff (including runs allowed, strikeouts and walks), and the strength of each team's offense (including runs scored, home runs, walks, strikeouts and stolen bases).&lt;/p&gt;
&lt;p&gt;Putting these statistics into a model, we can determine which factors are important in winning postseason games.  (For those statistically minded, it's a &lt;a href=&quot;http://en.wikipedia.org/wiki/Logistic_regression&quot;&gt;logistic regression model&lt;/a&gt;, designed to model binary outcomes -- in this case whether the team in question won or lost.)  This allows us to predict outcomes based on a variety of different characteristics of each team.&lt;/p&gt;
&lt;p&gt;As it turns out, very few of the commonly held theories are true.  Think that teams who steal a lot of bases have an advantage in the playoffs?  When controlling for other factors, the team with the speed advantage did not perform any better in the playoffs.  How about the &quot;good pitching beats good hitting&quot; mantra?  Again, when taking into account the records of the two teams, teams that thrived via pitching and defense fared no better than the slugging teams.  Think that power staffs that dominate with strikeouts are ripe for postseason success?  The team with the power pitching advantage gained no special edge.  And, when accounting for other factors, the team that won via the long ball fared no better or worse than its punch-and-judy counterparts.  In fact, none of the team characteristics had a &lt;a href=&quot;http://en.wikipedia.org/wiki/P-value&quot; target=&quot;_blank&quot;&gt;statistical p-value&lt;/a&gt; that was close to statistically significant.&lt;/p&gt;
&lt;p&gt;How about a team's &lt;a href=&quot;http://en.wikipedia.org/wiki/Pythagorean_expectation&quot;&gt;Pythagorean record&lt;/a&gt;? Supposedly a more consistent indicator of team performance than vanilla run differential, is it actually an indicator of postseason success?  Regular records and Pythagorean records are highly correlated, so it's difficult to separate the two, but the model clearly preferred looking at a team's regular record.  The model suggests giving a team's regular record about 80 percent of the weight, and giving only 20 percent of the weight to the Pythagorean record.  However, the Pythagorean record is not statistically significant in the model, meaning that you'll probably have nearly as good of prognosticating ability simply using a team's regular-season record alone.&lt;/p&gt;
&lt;p&gt;So what &lt;em&gt;does &lt;/em&gt;make a difference?&lt;/p&gt;</intro>
</blog-post>
