SQL關聯查詢直接join 和子查詢的區別

發布時間：2020-07-21 17:22:43 來源：網絡閱讀：7566 作者：layveen 欄目：MySQL數據庫

運營組的同事最近提出一個需求，希望可以統計出用系統用戶及訂單情況，于是乎我們很想當然的寫出了一個統計SQL，用戶表user和行程表直接join，并且針對行程做了group，但SQL執行速度出奇的慢。

explain select  users.`mobile_num`, concat(users.`lastName` ,users.`firstName`) as userName, users.`company`,
  (case `users`.`idPhotoCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `idPhotoCheckStatus`,
  (case `users`.`driverLicenseCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `driverLicenseCheckStatus`,
  (case `users`.`companyCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `companyCheckStatus`,
  (case `users`.`unionCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `unionCheckStatus`,
  count(passenger_trip.id) as ptrip_num
from users
left join passenger_trip on passenger_trip.userId = users.id  and passenger_trip.status != 'cancel'
left join driver_trip on driver_trip.`userId`=users.`id` and driver_trip.`status` != 'cancel'
where company != '本公司名' and company != '本公司昵稱'

當時的第一反應是數據庫掛住了，因為用戶表的數據量10W左右，行程表的數據也是10W左右，不可能這么慢！通過explain查看分析計劃，并且查看過關聯字段的索引情況，發現這是一個最常見的關聯查詢，當然是通過join實現。

轉而一想，10W*10W，經過笛卡爾集之后，這不是百億級的數據篩選嗎？！于是換了一種寫法進行嘗試。

explain select  users.`mobile_num`, concat(users.`lastName` ,users.`firstName`) as userName, users.`company`,
  (case `users`.`idPhotoCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `idPhotoCheckStatus`,
  (case `users`.`driverLicenseCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `driverLicenseCheckStatus`,
  (case `users`.`companyCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `companyCheckStatus`,
  (case `users`.`unionCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `unionCheckStatus`,
  (select count(passenger_trip.id) from  passenger_trip where  passenger_trip.userId = users.id  and passenger_trip.status != 'cancel') as ptrip_num,
  (select count(driver_trip.id) from  driver_trip where  driver_trip.userId = users.id  and driver_trip.status != 'cancel') as dtrip_num
from users
where company != '本公司名' and company != '公司昵稱'

這樣的效果居然比直接join快了N倍，執行速度從未知到10秒內返回，查看執行計劃：

SQL關聯查詢直接join 和子查詢的區別

進一步調整SQL進行嘗試：

explain select  users.`mobile_num`, concat(users.`lastName` ,users.`firstName`) as userName, users.`company`,
  (case `users`.`idPhotoCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `idPhotoCheckStatus`,
  (case `users`.`driverLicenseCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `driverLicenseCheckStatus`,
  (case `users`.`companyCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `companyCheckStatus`,
  (case `users`.`unionCheckStatus` when '2' then '已認證' when '3' then '已駁回' else '待認證' end) as `unionCheckStatus`,
 ptrip_num, dtrip_num
from users 
 left  join 
 (select count(passenger_trip.id)  as ptrip_num, passenger_trip.`userId` from  passenger_trip where  passenger_trip.status != 'cancel' group by passenger_trip.`userId` ) as ptrip
 on ptrip.userId = users.id
 left join 
 (select count(driver_trip.id)  as dtrip_num, driver_trip.`userId` from  driver_trip where  driver_trip.status != 'cancel' group by driver_trip.`userId` ) as dtrip
 on dtrip.userId = users.id
where company != '本公司名' and company != '公司昵稱'

居然5秒內返回，這才是正常的預期，10W級的數據篩選，應該是幾秒內返回的！

SQL關聯查詢直接join 和子查詢的區別

出現這種差別的原因，其實很簡單，SQL語句執行的時候是有一定順序的。

from 先選擇一個表，構成一個結果集。
where 對結果集進行篩選，篩選出需要的信息形成新的結果集。
group by 對新的結果集分組。
having 篩選出想要的分組。
select 選擇列。
order by 當所有的條件都弄完了。最后排序。

第一種寫法，直接join的結果，就是在100億條數據中進行篩選；
后面兩種則是優先執行子查詢，完成10W級別的查詢，再進行一次主表10W級的關聯查詢，所以數量級明顯少于第一種寫法。

向AI問一下細節

中文字幕av专区_日韩电影在线播放_精品国产精品久久一区免费式_av在线免费观看网站

SQL關聯查詢直接join 和子查詢的區別

猜你喜歡

中文字幕av专区_日韩电影在线播放_精品国产精品久久一区免费式_av在线免费观看网站

SQL關聯查詢 直接join 和子查詢的區別

猜你喜歡

最新資訊

相關推薦

相關標簽

SQL關聯查詢直接join 和子查詢的區別