On the other hand, sometimes it does some pretty boneheaded things that don’t seem to make much sense.
Let’s take a look at some examples… from one side of the spectrum to the other.
(Important Note: All the boneheaded behavior I talk about in this article is found in SQL2005. SQL2008, on the other hand, does not exhibit any of the boneheaded behavior.)
Take the following query, which we’ll call Query #1:
/* Query #1 */What goes through the Optimizer’s head when it constructs this query?
select distinct CustomerID,ProductID
from AdventureWorks.Sales.SalesOrderDetail sod
join AdventureWorks.Sales.SalesOrderHeader soh on sod.SalesOrderID=soh.SalesOrderID
where 1=2
and power(2,sqrt(0))<>1
and char(-1) is not null
and sin(pi()/2)<>1
and soh.SalesOrderID is null
data:image/s3,"s3://crabby-images/0abfd/0abfdb30ddaf5ab424a0d495349859e9b2f6b6ff" alt="SQL Server"
This is the query plan that it puts together for the above query. It just plops out an empty set:
data:image/s3,"s3://crabby-images/4fbfc/4fbfc46b65716f0656f45d8a541856251220ba26" alt="Query #1 Plan"
Excellent.
The following query (Query #2) is exactly like Query #1, except all the predicates are negated so that they will all evaluate to True:
/* Query #2 */This is the plan that is generated:
select distinct CustomerID,ProductID
from AdventureWorks.Sales.SalesOrderDetail sod
join AdventureWorks.Sales.SalesOrderHeader soh on sod.SalesOrderID=soh.SalesOrderID
where 1<>2
and power(2,sqrt(0))=1
and char(-1) is null
and sin(pi()/2)=1
and soh.SalesOrderID is not null
data:image/s3,"s3://crabby-images/6cff4/6cff4613398245c4e99fc15bc5a2c106a11a29e6" alt="Query #2 Plan"
The Optimizer’s thinking is:
data:image/s3,"s3://crabby-images/0abfd/0abfdb30ddaf5ab424a0d495349859e9b2f6b6ff" alt="SQL Server"
So it knows the WHERE clause is not needed, and it decides to SCAN the smaller-sized nonclustered indexes to get the data it needs. Brilliant! The Optimizer should be applauded.
What do our judges think of the plans for Query #1 and Query #2?
data:image/s3,"s3://crabby-images/429a4/429a461abb583e99c6a82ab1e3d948c79d3948b7" alt="Two Einsteins"
They awarded the plans Two Einsteins! Very impressive.
Now take a look at this query (Query #3). It’s exactly the same as Query #2, only with a different WHERE clause. Again, though, all the predicates evaluate to True… at least you and I can easily see that:
/* Query #3 */But the Optimizer doesn’t see it that way. Here’s the query plan for that query:
select distinct CustomerID,ProductID
from AdventureWorks.Sales.SalesOrderDetail sod
join AdventureWorks.Sales.SalesOrderHeader soh on sod.SalesOrderID=soh.SalesOrderID
where SalesOrderDetailID=SalesOrderDetailID
and ProductID=ProductID
and SpecialOfferID=SpecialOfferID
and OrderQty=OrderQty
and UnitPrice=UnitPrice
data:image/s3,"s3://crabby-images/31fb8/31fb802b6b9200d4d057881d8cbc08cc4a258123" alt="Query #3 Plan"
Note the arrows between the operators are a lot thinner than in the previous query, and note that it is doing a SCAN of the SalesOrderDetail clustered index and a SEEK into the SalesOrderHeader table. Here is apparently what’s going on inside the Optimizer’s head as it constructs this plan:
data:image/s3,"s3://crabby-images/0abfd/0abfdb30ddaf5ab424a0d495349859e9b2f6b6ff" alt="SQL Server"
You can see the estimate of 1.21317 rows here in the properties of the Clustered Index Scan:
data:image/s3,"s3://crabby-images/94334/943346b0709701ab0dc088b7a9bcf49688f26322" alt="Estimated Rows in Query #3"
This doesn’t make much sense. Why does the Optimizer think that it’s comparing columns to some unknown value? The only time something isn’t equal to itself is when it has a value of NULL, but all 5 columns referred to in the WHERE clause are non-nullable columns. It’s surprising that the Optimizer can’t see that each predicate is True.
And the plan that it did come up with performs terribly. Look at a comparison in performance between Query #2 and Query #3:
/* CPU #Reads
-------------------------
Query #2: 210ms 299
Query #3: 695ms 365,458
*/
Yikes! Look at the number of Reads! Think about it… in executing Query #3, for each one of the 121,317 rows in SalesOrderDetail, it does a SEEK into SalesOrderHeader. That’s really inefficient.I don’t think our panel of judges will be thrilled with this, do you? Let’s see what they thought of the Query #3 plan:
data:image/s3,"s3://crabby-images/916b7/916b7b86c469ebee1fe4d18ea7bc5872e9378ae5" alt="Four Curlys"
Four Curlys! That’s gotta hurt. But you have to admit... It was a boneheaded plan.
(Important Note: The above phenomenon occurs in SQL2005. It does not behave the same in SQL2008. In SQL2008, Query #2 and Query #3 have identical query plans.)
Here’s another example in inconsistent approaches by the Optimizer. Look at the following query (Query #4), which uses window functions in OVER clauses in a CTE:
/* Query #4 */What does the Optimizer think when it constructs the plan for that query?
with cte as
(
select SalesOrderID,ProductID
,RNum=row_number() over (partition by ProductID order by SalesOrderID)
,Rnk=rank() over (partition by SpecialOfferID order by CarrierTrackingNumber)
,DRnk=dense_rank() over (partition by UnitPrice order by RowGUID)
from AdventureWorks.Sales.SalesOrderDetail
)
select SalesOrderID,ProductID
from cte
data:image/s3,"s3://crabby-images/0abfd/0abfdb30ddaf5ab424a0d495349859e9b2f6b6ff" alt="SQL Server"
And that’s what it does, as we can see from the following plan:
data:image/s3,"s3://crabby-images/8b89a/8b89ad3b0ec7fea514afc52b4d2c7de9833f4484" alt="Query #4 Plan"
That’s a very sensible plan. What do the judges say?
data:image/s3,"s3://crabby-images/54c71/54c71f3bade8b582a78c4e553291f20f03eab443" alt="Three Doc Browns"
Three Emmett Browns, huh?. Sure, he’s a bit quirky, and he’s not as impressive as Einstein or Plato or Galileo or Leonardo or Mozart, but hey, he invented time travel… that’s pretty cool.
Now look at the following query (Query #5), which is exactly like Query #4, except that it uses aggregates in OVER clauses in the CTE rather than window functions:
/* Query #5 */Wouldn’t you think that the Optimizer would come up with the exact same plan as Query #4? But nooooo… here’s what it’s thinking:
with cte as
(
select SalesOrderID,ProductID
,Cnt=count(*) over (partition by ProductID)
,TotQty=sum(OrderQty) over (partition by SpecialOfferID)
,AvgLine=avg(LineTotal) over (partition by UnitPrice)
from AdventureWorks.Sales.SalesOrderDetail
)
select SalesOrderID,ProductID
from cte
data:image/s3,"s3://crabby-images/0abfd/0abfdb30ddaf5ab424a0d495349859e9b2f6b6ff" alt="SQL Server"
That kind of sounds like my dog when I pull out the bag of dog food every single night. Sure enough, here’s the plan:
data:image/s3,"s3://crabby-images/110e2/110e20d447ab432a9e872b644eb1b00d8a3c4291" alt="Query #5 Plan"
Look at all those operators! All those aggregations and sorts and everything, and it’s all a waste of time because none of the calculations are in the final result set.
Look at the comparison in performance between Query #4 and Query #5:
/* CPU #Reads
-------------------------
Query #4: 39ms 236
Query #5: 3059ms 811,834
*/
Incredible. Query #5 had over 3400 times the number of reads as Query #4 and almost 100 times the CPU time.Judges? Your verdict?
data:image/s3,"s3://crabby-images/7e66e/7e66e903638d8369ba0c48554de3ec56128d353a" alt="Nine SpongeBobs"
Ouch!! Nine SpongeBobs! The judges definitely thought that was an atrocious, putrid, abysmal, boneheaded plan.
(Important Note: The above phenomenon occurs in SQL2005. It does not behave the same in SQL2008. In SQL2008, Query #4 and Query #5 have identical query plans.)
Of course, none of the above queries are necessarily ones you’d actually put together in the real world (though I did stumble upon the behavior of Query #4 and #5 while answering a question in the MSDN T-SQL Forum). And I don’t mean to put down the Optimizer. I guess “bonehead” is kind of a strong word. Perhaps the Optimizer is sometimes just a little bit naïve.
I guess the moral of the story, as always, is to look carefully at your query plans. In the words of another person bordering on genius and naïveté: “Query Plans are like a box of chocolates… You never know what you’re gonna get.”